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DESCRIPTION 



METHOD AND REAGENT FOR THE INHIBITION OF CALCIUM 
ACTIVATED CHLORIDE CHANNEL- 1 (CLCA-1) 



Background Of The Invention 
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The present invention concerns compounds, compositions, and methods for 
the study, diagnosis, and treatment of conditions and diseases related to the 
expression of CLCA (CI" Channel Ca 2+ -Activated) genes. 

The following is a brief description of the current understanding of CLCAs. 
The discussion is not meant to be complete and is provided only for understanding 
the invention that follows. The summary is not an admission that any of the work 
described below is prior art to the claimed invention. 

CLCA proteins are emerging as a new class of channel proteins that mediate 
Ca 2 +-activated CI" conductance in a variety of tissues. Members of the CLCA 
family have been cloned, isolated, and partially characterized from human, bovine, 
and murine species. These proteins demonstrate a high degree of homology in their 
size, sequence, and predicted structure yet can vary considerably in tissue 
distribution. Bovine CLCA1 (bCLCAl or CaCC) was the first reported CLCA 
homolog. The bCLCAl protein, which was isolated from and is exclusively 
detected in trachial epithelial cells, functions as a Ca 2+ -activated CL channel (Ran 
and Benos, 1992, J. Biol Chem., 267, 3618-3625; Cunningham et al, 1995, J. Biol 
Chem., 270, 31016-31026). Another bovine homolog, bovine lung-endothelial cell 
adhesion molecule- 1 (Lu-ECAM-1), appears to have involvement in the preferential 
metastasis of melanoma cells to the lung. Lu-ECAM-1 shares 92% nucleotide 
identity to bCLCAl and is expressed in vascular endothelial cells (Elble et al, 1997, 
J. Biol Chem., 272, 27853-27861). It has been shown that Lu-ECAM-1, can 
mediate the binding of lung-metastatic mouse B16F10 melanoma cells to endothelial 
cells (Zhu et al, 1992, 7. Clin. Invest, 89, 1718-1724), however, due to sequence 
similarity to bCLCAl, the role of Lu-ECAM-1 as a chloride channel has been 
suggested (Elble et al, supra). The mouse homolog, mCLCAl, appears to have an 
expression pattern similar to the cystic fibrosis transmembrane conductance 
regulator (CFTR), with expression seen in various secretory epithelial cells, 
squamous epithelia, and in some lymphocytes (Gruber et al, 1998, Histochem. Cell 
Biol, 110,43-49). 
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The three human CLCA homologs (hCLCAl, hCLCA2 5 and hCLCA3) thus 
far cloned, isolated, and partially characterized, all retain sequence homology, 
similar cDNA length, and are all located on the short arm of chromosome 1 (lp22- 
p31). Human CLCA proteins show a restricted pattern of expression in differing 
secretory tissues. Human CLCA1 was the first reported calcium activated chloride 
channel in humans. The 3 1,902 -bp hCLCAl gene is located on chromosome lp22- 
p31, contains 14 introns, and is preceded by a canonic promoter region that contains 
an LI transposable element. Expression of hCLCAl is predominant in intestinal 
basal crypt epithelia and goblet cells. A protein processing model has been proposed 
for hCLCAl in which the primary translation product (125-kDa) is cleaved to a 90- 
kDa and a group of 37- to 41-kDa proteins, the latter apparently representing 
different glycosylation products of the same polypeptide (Gruber et al, 1998, 
Genomics, 54, 200-214). Transient expression of hCLCAl cDNA in HEK 293 cells 
is associated with an increase in whole-cell Ca 2+ -activated Cl~ conductance that is 
susceptible to inhibition with anion channel blocking compounds. Cell attached 
patch recordings of transfected cells in this study revealed single channels with a 
slope conductance of 13.4 pS (Gruber et al, supra). 

The hCLCA2 homo log is processed in a similar manner as is hCLCAl, 
resulting in the formation of a heterodimer consisting of a 90-kDa amino terminal 
and an approximately 35-kDa carboxy terminal subunit with anchorage to the 
plasma membrane via four or five transmembrane domains. Expression of hCLC A2 
is somewhat less restricted than that of hCLCAl, being expressed from human lung, 
trachea, and breast tissue (Gruber et al, 1999, Am. J. Physiol, 276, C1261-C1270). 
Human CLCA2 is expressed in normal breast epithelium but not in breast tumors of 
different stages of progression, suggesting that hCLCA2 may act as a tumor 
suppressor in breast cancer (Gruber et al, 1999, Cancer Res. t 59, 5488-5491). 
Human CLCA3 is a truncated, secreted member of the CLCA family which is 
expressed in numerous tissues including lung, trachea, spleen, thymus, and breast 
tissue. Unlike hCLCAl and hCLCA2 which are processed into heterodimers, 
hCLCA3 mRNA encodes a 37-kDa glycoprotein that corresponds to the N- terminal 
extracellular domain of its homologs. When hCLCA3 is expressed in HEK 293 or 
CHO cells, the 37-kDa glycoprotein is secreted (Gruber and Pauli, 1999, Biochem. 
Biophys. Acta, 1444, 418-423). 

Holroyd et al, International PCT publication No. WO/9944620, describe a 
calcium-activated chloride channel that is induced by IL-9. 
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Summary Of The Invention 

The invention features novel nucleic acid-based techniques [e.g., enzymatic 
nucleic acid molecules (ribozymes), antisense nucleic acids, 2-5A antisense 
chimeras, triplex DNA, antisense nucleic acids containing RNA cleaving chemical 
5 groups] and methods for their use to modulate the expression of CLCA (CI" Channel 
Ca 24 "- Activated) genes. 

In a preferred embodiment, the invention features the use of one or more of the 
nucleic acid-based techniques independently or in combination to inhibit the 
expression of the genes encoding hCLCAl, hCLCA2, hCLCA3, and hCLCA4. 

10 Specifically, the invention features the use of nucleic acid-based techniques to 
specifically inhibit the expression of CLCA1 (GenBank accession Nos. 
NM_001285, AF039400, AF039401, AF127036), CLCA2 (GenBank accession No. 
NM_006536), CLCA3 (GenBank accession No. NM_004921), and CLCA4 
(GenBank accession No. NM012128) genes. In yet another preferred embodiment, 

15 the invention features the inhibition of CLCA1 gene using the nucleic acid-based 
techniques of the instant invention. 

In another preferred embodiment, the invention features the use of an 
enzymatic nucleic acid molecule, preferably in the hammerhead, NCH (Inozyme), G- 
cleaver, amberzyme, zinzyme and/or DNAzyme motif, to inhibit the expression of 
20 CLCA genes. 

By "inhibit" it is meant that the activity of CLCA1 or level of RNAs or 
equivalent RNAs encoding one or more protein subunits of CLCA1 is reduced 
below that observed in the absence of the nucleic acid molecules of the invention. In 
one embodiment, inhibition with enzymatic nucleic acid molecules preferably is 

25 below that level observed in the presence of an enzymatically inactive or attenuated 
molecule that is able to bind to the same site on the target RNA, but is unable to 
cleave that RNA. In another embodiment, inhibition with antisense oligonucleotides 
is preferably below that level observed in the presence of, for example, an 
oligonucleotide with scrambled sequence or with mismatches. In another 

30 embodiment, inhibition of CLCA1 genes with the nucleic acid molecule of the 
instant invention is greater than in the presence of the nucleic acid molecule than in 
its absence, or the presence of a control, irrelevant, or non-inhibitory 
oligonucleotide. 



By "enzymatic nucleic acid molecule" it is meant a nucleic acid molecule 
which has complementarity in a substrate binding region to a specified gene target, 
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and also has an enzymatic activity which is active to specifically cleave target RNA. 
That is, the enzymatic nucleic acid molecule is able to intermolecularly cleave RNA 
and thereby inactivate a target RNA molecule. These complementary regions allow 
sufficient hybridization of the enzymatic nucleic acid molecule to the target RNA 
5 and thus permit cleavage. One hundred percent complementarity is preferred, but 
complementarity as low as 50-75% may also be useful in this invention. The nucleic 
acids may be modified at the base, sugar, and/or phosphate groups. The term 
enzymatic nucleic acid is used interchangeably with phrases such as ribozymes, 
catalytic RNA, enzymatic RNA, catalytic DNA, aptazyme or aptamer-binding 

1 0 ribozyme, regulatable ribozyme, catalytic oligonucleotides, nucleozyme, DNAzyme, 
RNA enzyme, endoribonuclease, endonuclease, minizyme, leadzyme, oligozyme or 
DNA enzyme. All of these terminologies describe nucleic acid molecules with 
enzymatic activity. The specific enzymatic nucleic acid molecules described in the 
instant application are not meant to be limiting and those skilled in the art will 

15 recognize that all that is important in an enzymatic nucleic acid molecule of this 
invention is that it have a specific substrate binding site which is complementary to 
one or more of the target nucleic acid regions, and that it have nucleotide sequences 
within or surrounding that substrate binding site which impart a nucleic acid 
cleaving activity to the molecule (Cech et al, U.S. Patent No. 4,987,071; Cech et al, 

20 1988, JAMA). 

By "nucleic acid molecule" as used herein is meant a molecule having 
nucleotides. The nucleic acid can be single, double, or multiple stranded and may 
comprise modified or unmodified nucleotides or non-nucleotides or various mixtures 
and combinations thereof. 

25 By "enzymatic portion" or "catalytic domain" is meant that portion/region of 

the enzymatic nucleic acid molecule essential for cleavage of a nucleic acid substrate 
(for example, see Figures 1-4). 

By "substrate binding arm" or "substrate binding domain" is meant that 
portion/region of a ribozyme which is complementary to (i.e., able to base-pair with) 

30 a portion of its substrate. Generally, such complementarity is 100%, but can be less 
if desired. For example, as few as 10 bases out of 14 may be base-paired. Examples 
of such arms are shown generally in Figures 1-4. That is, these arms contain 
sequences within a ribozyme which are intended to bring ribozyme and target RNA 
together through complementary base-pairing interactions. The ribozyme of the 

35 invention may have binding arms that are contiguous or non-contiguous and may be 
of varying lengths. The length of the binding arm(s) are preferably greater than or 
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equal to four nucleotides and of sufficient length to stably interact with the target 
RNA; specifically 12-100 nucleotides; more specifically 14-24 nucleotides long. If 
two binding arms are chosen, the design is such that the length of the binding arms 
are symmetrical {i.e., each of the binding arms is of the same length; e.g., five and 
5 five nucleotides, six and six nucleotides or seven and seven nucleotides long) or 
asymmetrical (i.e., the binding arms are of different length; e.g., six and three 
nucleotides; three and six nucleotides long; four and five nucleotides long; four and 
six nucleotides long; four and seven nucleotides long; and the like). 

By "NCH" or "Inozyme" motif is meant, an enzymatic nucleic acid molecule 
10 comprising a motif as described in Ludwig et aL, USSN No. 09/406,643, filed 
September 27, 1999, entitled "COMPOSITIONS HAVING RNA CLEAVING 
ACTIVITY", and International PCT publication Nos. WO 98/58058 and WO 
98/58057, all incorporated by reference herein in their entirety including the 
drawings. 

15 By "G-cleaver" motif is meant, an enzymatic nucleic acid molecule 

comprising a motif as described in Eckstein et aL, International PCT publication No. 
WO 99/16871, incorporated by reference herein in its entirety including the 
drawings. 

By "zinzyme" motif is meant, a class II enzymatic nucleic acid molecule 
20 comprising a motif as described in Beigelman et aL, International PCT publication 
No. WO 99/55857, incorporated by reference herein in its entirety including the 
drawings. Zinzymes represent a non-limiting example of an enzymatic nucleic acid 
molecule that does not require a ribonucleotide (2' -OH) group within its own nucleic 
acid sequence for activity. 

25 By "amberzyme" motif is meant, a class I enzymatic nucleic acid molecule 

comprising a motif as described in Beigelman et aL, International PCT publication 
No. WO 99/55857, incorporated by reference herein in its entirety including the 
drawings. Amberzymes represent a non-limiting example of an enzymatic nucleic 
acid molecule that does not require a ribonucleotide (2' -OH) group within its own 

30 nucleic acid sequence for activity. 

By 'DNAzyme' is meant, an enzymatic nucleic acid molecule that does not 
require the presence of a ribonucleotide (2 5 -OH) group within the DNAzyme 
molecule for its activity. In particular embodiments the enzymatic nucleic acid 
molecule may have an attached linker(s) or other attached or associated groups, 
35 moieties, or chains containing one or more nucleotides with 2'-OH groups. 
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DNAzyme can be synthesized chemically or expressed endogenously in vivo, by 
means of a single stranded DNA vector or equivalent thereof. 

By "sufficient length" is meant an oligonucleotide of greater than or equal to 3 
nucleotides that is of a length great enough to provide the intended function under 
the expected condition. For example, for binding arms of enzymatic nucleic acid 
"sufficient length" means that the binding arm sequence is long enough to provide 
stable binding to a target site under the expected binding conditions. Preferably, the 
binding arms are not so long as to prevent useful turnover. 

By "stably interact" is meant, interaction of the oligonucleotides with target 
nucleic acid (e.g., by forming hydrogen bonds with complementary nucleotides in 
the target under physiological conditions). 

By "equivalent" RNA to CLCA1 is meant to include those naturally occurring 
RNA molecules having homology (partial or complete) to CLCA1 proteins or 
encoding for proteins with similar function as CLCA1 in various organisms, 
including human, rodent, primate, rabbit, pig, protozoans, fungi, plants, and other 
microorganisms and parasites. The equivalent RNA sequence also includes in 
addition to the coding region, regions such as 5 5 -untranslated region, 3 '-untranslated 
region, introns, intron-exon junction and the like. 

By "homology" is meant the nucleotide sequence of two or more nucleic acid 
molecules is partially or completely identical. 

By "antisense nucleic acid", it is meant a non-enzymatic nucleic acid 
molecule that binds to target RNA by means of RNA-RNA or RNA-DNA or RNA- 
PNA (protein nucleic acid; Egholm et al, 1993 Nature 365, 566) interactions and 
alters the activity of the target RNA (for a review, see Stein and Cheng, 1993 
Science 261, 1004 and Woolf et al, US patent No. 5,849,902). Typically, antisense 
molecules will be complementary to a target sequence along a single contiguous 
sequence of the antisense molecule. However, in certain embodiments, an antisense 
molecule may bind to substrate such that the substrate molecule forms a loop, and/or 
an antisense molecule may bind such that the antisense molecule forms a loop. 
Thus, the antisense molecule may be complementary to two (or even more) non- 
contiguous substrate sequences or two (or even more) non-contiguous sequence 
portions of an antisense molecule may be complementary to a target sequence or 
both. For a review of current antisense strategies, see Schmajuk et al, 1999, J. Biol. 
Chem., 274, 21783-21789, Delihas et al, 1997, Nature, 15, 751-753, Stein et al, 
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1997, Antisense N. A. Drug Dev., 7, 151, Crooke, 1998, Biotech. Genet. Eng. Rev., 
15, 121-157, Crooke, 1997, Ad. Pharmacol., 40, 1-49. In addition, antisense DNA 
can be used to target RNA by means of DNA-RNA interactions, thereby activating 
RNase H, which digests the target RNA in the duplex. Antisense DNA can be 
synthesized chemically or expressed via the use of a single stranded DNA expression 
vector or equivalent thereof. 

By "2-5A antisense chimera" it is meant, an antisense oligonucleotide 
containing a 5 -phosphorylated 2 , -5 , -linked adenylate residue. These chimeras bind 
to target RNA in a sequence-specific manner and activate a cellular 2-5A-dependent 
ribonuclease which, in turn, cleaves the target RNA (Torrence et al, 1993 Proc. 
Natl Acad. Sci. USA 90, 1300). 

By "triplex DNA" it is meant an oligonucleotide that can bind to a double- 
stranded DNA in a sequence-specific manner to form a triple-strand helix. 
Formation of such triple helix structure has been shown to inhibit transcription of the 
targeted gene (Duval- Valentin et al, 1992 Proc. Natl. Acad. Sci. USA 89, 504). 

By "gene" it is meant a nucleic acid that encodes an RNA. 

By "complementarity" is meant that a nucleic acid can form hydrogen bond(s) 
with another RNA sequence by either traditional Watson-Crick or other non- 
traditional types. In reference to the nucleic molecules of the present invention, the 
binding free energy for a nucleic acid molecule with its target or complementary 
sequence is sufficient to allow the relevant function of the nucleic acid to proceed, 
e.g., ribozyme cleavage, antisense or triple helix inhibition. Determination of 
binding free energies for nucleic acid molecules is well known in the art (see, e.g., 
Turner et al., 1987, CSH Symp. Quant. Biol LH pp.123-133; Frier et al, 1986, Proc. 
Nat Acad. Set USA 83:9373-9377; Turner et al, 1987, J. Am. Chem. Soc. 
109:3783-3785). A percent complementarity indicates the percentage of contiguous 
residues in a nucleic acid molecule which can form hydrogen bonds (e.g., Watson- 
Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 
10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). "Perfectly 
complementary" means that all the contiguous residues of a nucleic acid sequence 
will hydrogen bond with the same number of contiguous residues in a second nucleic 
acid sequence. 

At least seven basic varieties of naturally occurring enzymatic nucleic acids 
are known presently. Each can catalyze the hydrolysis of RNA phosphodiester 
bonds in trans (and thus can cleave other RNA molecules) under physiological 
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conditions. Table I summarizes some of the characteristics of these ribozymes. In 
general, enzymatic nucleic acids act by first binding to a target RNA. Such binding 
occurs through the target binding portion of a enzymatic nucleic acid which is held 
in close proximity to an enzymatic portion of the molecule that acts to cleave the 
5 target RNA. Thus, the enzymatic nucleic acid first recognizes and then binds a 
target RNA through complementary base-pairing, and once bound to the correct site, 
acts enzymatically to cut the target RNA. Strategic cleavage of such a target RNA 
will destroy its ability to direct synthesis of an encoded protein. After an enzymatic 
nucleic acid has bound and cleaved its RNA target, it is released from that RNA to 
1 0 search for another target and can repeatedly bind and cleave new targets. Thus, a 
single ribozyme molecule is able to cleave many molecules of target RNA. In 
4? addition, the ribozyme is a highly specific inhibitor of gene expression, with the 

yS specificity of inhibition depending not only on the base-pairing mechanism of 

ri binding to the target RNA, but also on the mechanism of target RNA cleavage, 

o 15 Single mismatches, or base-substitutions, near the site of cleavage can completely 

HF eliminate catalytic activity of a ribozyme. 

y i 

^ The enzymatic nucleic acid molecule that cleave the specified sites in CLCA1- 

85 specific RNAs represent a novel therapeutic approach to treat Chronic Obstructive 

4f Pulmonary Diseases (COPDs), chronic bronchitis, asthma, cystic fibrosis, 

Q 20 obstructive bowel syndrome, and other indications that may respond to the level of 
CLCAL 

In one of the preferred embodiments of the inventions described herein, the 
enzymatic nucleic acid molecule is formed in a hammerhead or hairpin motif, but 
may also be formed in the motif of a hepatitis delta virus, group I intron, group II 

25 intron or RNase P RNA (in association with an RNA guide sequence), Neurospora 
VS RNA, DNAzymes, NCH cleaving motifs, or G-cleavers. Examples of such 
hammerhead motifs are described by Dreyfus, supra, Rossi et al., 1992, AIDS 
Research and Human Retroviruses 8, 183; Examples of hairpin motifs are described 
by Hampel et al. 9 EP0360257, Hampel and Tritz, 1989 Biochemistry 28, 4929, 

30 Feldstein et al, 1989, Gene 82, 53, Haseloff and Gerlach, 1989, Gene, 82, 43, 
Hampel et al, 1990 Nucleic Acids Res. 18, 299; Chowrira & McSwiggen, US. 
Patent No. 5,631,359. The hepatitis delta virus motif is described by Perrotta and 
Been, 1992 Biochemistry 31, 16. The RNase P motif is described by Guerrier- 
Takada et al. 9 1983 Cell 35, 849; Forster and Altaian, 1990, Science 249, 783; Li and 

35 Altaian, 1996, Nucleic Acids Res. 24, 835. Neurospora VS RNA ribozyme motif is 
described by Collins (Saville and Collins, 1990 Cell 61, 685-696; Saville and 
Collins, 1991 Proc. Natl. Acad. Sci. USA 88, 8826-8830; Collins and Olive, 1993 
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Biochemistry 32, 2795-2799; Guo and Collins, 1995, EMBO. J. 14, 363). Group II * 
introns are described by Griffin et aL, 1995, Chem. Biol 2, 761; Michels and Pyle, 
1995, Biochemistry 34, 2965; Pyle et aL, International PCT Publication No. WO 
96/22689. The Group I intron is described by Cech et aL, U.S. Patent 4,987,071. 
DNAzymes are described by Usman et aL, International PCT Publication No. WO ' 
95/11304; Chartrand et aL, 1995, NAR 23, 4092; Breaker et aL, 1995, Chem. Bio. 2, 
655; Santoro et aL, 1997, PNAS 94, 4262. NCH cleaving motifs are described in 
Ludwig & Sproat, International PCT Publication No. WO 98/58058; and G-cleavers 
are described in Kore et aL, 1998, Nucleic Acids Research 26, 4116-4120 and 
Eckstein et al, International PCT Publication No. WO 99/16871 . Additional motifs 
such as the Aptazyme (Breaker et al, WO 98/43993), Amberzyme (Class I motif; 
Figure 3; Beigelman et aL, International PCT publication No. WO 99/55857) and 
Zinzyme (Beigelman et aL, International PCT publication No. WO 99/55857), all 
these references are incorporated by reference herein in their totalities, including 
drawings and can also be used in the present invention. These specific motifs are 
not limiting in the invention, and those skilled in the art will recognize that all that is 
important in an enzymatic nucleic acid molecule of this invention is that it has a 
specific substrate binding site which is complementary to one or more of the target 
gene RNA regions, and that it have nucleotide sequences within or surrounding that 
substrate binding site which impart an RNA cleaving activity to the molecule (Cech 
et aL, U.S. Patent No. 4,987,071). 

In preferred embodiments of the present invention, a nucleic acid molecule, 
e.g., an antisense molecule, a triplex DNA, or a ribozyme, is 13 to 100 nucleotides in 
length, e.g., in specific embodiments 35, 36, 37, or 38 nucleotides in length {e.g., for 
25 particular ribozymes or antisense). In particular embodiments, the nucleic acid 
molecule is 15-100, 17-100, 20-100, 21-100, 23-100, 25-100, 27-100, 30-100, 32- 
100, 35-100, 40-100, 50-100, 60-100, 70-100, or 80-100 nucleotides in length. 
Instead of 100 nucleotides being the upper limit on the length ranges specified 
above, the upper limit of the length range can be, for example, 30, 40, 50, 60, 70, or 
30 80 nucleotides. Thus, for any of the length ranges, the length range for particular 
embodiments has lower limit as specified, with an upper limit as specified which is 
greater than the lower limit. For example, in a particular embodiment, the length 
range can be 35-50 nucleotides in length. All such ranges are expressly included. 
Also in particular embodiments, a nucleic acid molecule can have a length which is 
35 any of the lengths specified above, for example, 21 nucleotides in length. 

In a preferred embodiment, the invention provides a method for producing a 
class of nucleic acid-based gene inhibiting agents which exhibit a high degree of 
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specificity for the RNA of a desired target. For example, the enzymatic nucleic acid 
molecule is preferably targeted to a highly conserved sequence region of target 
RNAs encoding CLCA proteins (for example, CLCA1, CLCA2, CLCA3 and/or 
CLCA4) such that specific treatment of a disease or condition can be provided with 
5 either one or several nucleic acid molecules of the invention. Such nucleic acid 
molecules can be delivered exogenously to specific tissue or cellular targets as 
required. Alternatively, the nucleic acid molecules (e.g., ribozymes and antisense) 
can be expressed from DNA and/or RNA vectors that are delivered to specific cells. 

In a preferred embodiment, the invention features the use of nucleic acid-based 
1 0 inhibitors of the invention to specifically target genes that share homology with the 
CLCA1 gene. 

3 — 3 

03 

itj As used herein "cell" is used in its usual biological sense, and does not refer to 

an entire multicellular organism, e.g., specifically does not refer to a human. The 
« cell may be present in a non-human multicellular organism, e.g., birds, plants and 

=C 15 mammals such as cows, sheep, apes, monkeys, swine, dogs, and cats. 

L. By "CLCA proteins" is meant, a protein or a mutant protein derivative thereof, 

El comprising a calcium activated chloride channel protein. 

J3 By "highly conserved sequence region" is meant, a nucleotide sequence of one 

[7 or more regions in a target gene does not vary significantly from one generation to 

20 the other or from one biological system to the other. 

The nucleic acid-based inhibitors of CLCA1 expression are useful for the 
prevention and/or treatment of diseases and conditions including Chronic 
Obstructive Pulmonary Disease (COPD), chronic bronchitis, asthma, cystic fibrosis, 
obstructive bowel syndrome, and any other diseases or conditions that are related to 
25 or will respond to the levels of CLCA1 in a cell or tissue, alone or in combination 
with other therapies. 

By "related" is meant that the reduction of CLCA1 expression (specifically 
CLCA1 gene) RNA levels and thus reduction in the level of the respective protein 
will relieve, to some extent, the symptoms of the disease or condition. 

30 The nucleic acid-based inhibitors of the invention are added directly, or can be 

complexed with cationic lipids, packaged within liposomes, or otherwise delivered 
to target cells or tissues. The nucleic acid or nucleic acid complexes can be locally 
administered to relevant tissues ex vivo, or in vivo through injection, infusion pump 
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or stent, with or without their incorporation in biopolymers. In preferred 
embodiments, the enzymatic nucleic acid inhibitors comprise sequences, which are 
complementary to the substrate sequences in Tables III to IX. Examples of such 
enzymatic nucleic acid molecules also are shown in Tables III to IX. Examples of 
such enzymatic nucleic acid molecules consist essentially of sequences defined in 
these Tables. 

In yet another embodiment, the invention features antisense nucleic acid 
molecules and 2-5A chimera including sequences complementary to the substrate 
sequences shown in Tables III to IX. Such nucleic acid molecules can include 
sequences as shown for the binding arms of the enzymatic nucleic acid molecules in 
Tables III to VIII and sequences shown as GeneBloc™ sequences in Table IX. 
Similarly, triplex molecules can be provided targeted to the corresponding DNA 
target regions, and containing the DNA equivalent of a target sequence or a sequence 
complementary to the specified target (substrate) sequence. Typically, antisense 
molecules will be complementary to a target sequence along a single contiguous 
sequence of the antisense molecule. However, in certain embodiments, an antisense 
molecule may bind to substrate such that the substrate molecule forms a loop, and/or 
an antisense molecule may bind such that the antisense molecule forms a loop. 
Thus, the antisense molecule may be complementary to two (or even more) non- 
contiguous substrate sequences or two (or even more) non-contiguous sequence 
portions of an antisense molecule may be complementary to a target sequence or 
both. 

By "consists essentially of is meant that the active nucleic acid molecule of 
the invention, for example, an enzymatic nucleic acid molecule, contains an 
enzymatic center or core equivalent to those in the examples, and binding arms able 
to bind RNA such that cleavage at the target site occurs. Other sequences can be 
present which do not interfere with such cleavage. Thus, a core region can, for 
example, include one or more loop, stem-loop structure, or linker which does not 
prevent enzymatic activity. Thus, the underlined regions in the sequences in Tables 
III, IV and VIII can be such a loop, stem-loop, nucleotide linker, and/or non- 
nucleotide linker and can be represented generally as sequence "X". For example, a 
core sequence for a hammerhead enzymatic nucleic acid can comprise a conserved 
sequence, such as 5'-CUGAUGAG-3' and 5'-CGAA-3' connected by "X", where X 
is 5 '- GCCGUUAGGC -3 * (SEQ ID NO 5450), or any other Stem H region known in 
the art. 



MBHB00-814-A (249.021) 



In another aspect of the invention, ribozymes or antisense molecules that 
interact with target RNA molecules and inhibit CLCA1 (specifically CLCA1 gene) 
activity are expressed from transcription units inserted into DNA or RNA vectors. 
The recombinant vectors are preferably DNA plasmids or viral vectors. Ribozyme or 
antisense expressing viral vectors could be constructed based on, but not limited to, 
adeno-associated virus, retrovirus, adenovirus, or alphavirus. Preferably, the 
recombinant vectors capable of expressing the ribozymes or antisense are delivered 
as described above, and persist in target cells. Alternatively, viral vectors may be 
used that provide for transient expression of ribozymes or antisense. Such vectors 
can be repeatedly administered as necessary. Once expressed, the ribozymes or 
antisense bind to the target RNA and inhibit its function or expression. Delivery of 
ribozyme or antisense expressing vectors can be systemic, such as by intravenous or 
intramuscular administration, by administration to target cells ex-planted from the 
patient followed by reintroduction into the patient, or by any other means that would 
allow for introduction into the desired target cell. Antisense DNA can be expressed 
endogenously via the use of a single stranded DNA intracellular expression vector. 

By RNA is meant a molecule comprising at least one ribonucleotide residue. 
By * 'ribonucleotide' * is meant a nucleotide with a hydroxyl group at the 2 y position of 
a (3-D-ribo-furanose moiety. 

By "vectors" is meant any nucleic acid- and/or viral-based technique used to 
deliver a desired nucleic acid. 

By "patient" is meant an organism, which is a donor or recipient of explanted 
cells or the cells themselves. "Patient" also refers to an organism to which the 
nucleic acid molecules of the invention can be administered. Preferably, a patient is 
a mammal or mammalian cells. More preferably, a patient is a human or human 
cells. 

The nucleic acid molecules of the instant invention, individually, or in 
combination or in conjunction with other drugs, can be used to treat diseases or 
conditions discussed above. For example, to treat a disease or condition associated 
with the levels of CLCA1, the patient may be treated, or other appropriate cells may 
be treated, as is evident to those skilled in the art, individually or in combination 
with one or more drugs under conditions suitable for the treatment. 

In a further embodiment, the described molecules, such as antisense or 
ribozymes, can be used in combination with other known treatments to treat 
conditions or diseases discussed above. For example, the described molecules could 
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be used in combination with one or more known therapeutic agents to treat Chronic 
Obstructive Pulmonary Diseases (COPDs), chronic bronchitis, asthma, cystic 
fibrosis, obstructive bowel syndrome, and/or other disease states or conditions which 
respond to the modulation of CLCA1 expression. 

5 In another preferred embodiment, the invention features nucleic acid-based 

inhibitors {e.g., enzymatic nucleic acid molecules (ribozymes), antisense nucleic 
acids, 2-5A antisense chimeras, triplex DNA, antisense nucleic acids containing 
RNA cleaving chemical groups) and methods for their use to down regulate or 
inhibit the expression of genes (e.g., CLCA1) capable of progression and/or 
10 maintenance of Chronic Obstructive Pulmonary Diseases (COPDs), chronic 
q bronchitis, asthma, cystic fibrosis, obstructive bowel syndrome, and/or other disease 

y3 states or conditions which respond to the modulation of CLCA1 expression. 

L^J By "comprising" is meant including, but not limited to, whatever follows the 

D word "comprising". Thus, use of the term "comprising" indicates that the listed 

jr 15 elements are required or mandatory, but that other elements are optional and may or 

y s 

B may not be present. By "consisting of* is meant including, and limited to, whatever 

□ follows the phrase "consisting of. Thus, the phrase "consisting of 1 indicates that the 

5j listed elements are required or mandatory, and that no other elements may be 

hD present. By "consisting essentially of 1 is meant including any elements listed after 

hf 20 the phrase, and limited to other elements that do not interfere with or contribute to 
the activity or action specified in the disclosure for the listed elements. Thus, the 
phrase "consisting essentially of indicates that the listed elements are required or 
mandatory, but that other elements are optional and may or may not be present 
depending upon whether or not they affect the activity or action of the listed 
25 elements. 

The foregoing description of the various aspects and embodiments is 
provided with reference to the exemplary calcium activated chloride channel gene 
CLCA1, which is also referred to as CaCCl or ICACC-1. However, the various 
aspects and embodiments are also directed to other genes which express CLCA1 or 
30 CaCCl-like proteins (for example hCLCA2, hCLCA3, hCLCA4, CaCC2, and 
CaCC3). Those additional genes can be analyzed for target sites using the methods 
described for CLCA1. Thus, the inhibition and the effects of such inhibition of the 
other genes can be performed as described herein. 
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Other features and advantages of the invention will be apparent from the 
following description of the preferred embodiments thereof, and from the claims. 
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Description Of The Preferred Embodiments 
First the drawings will be described briefly. 
Drawings 

Figure 1 shows examples of chemically stabilized ribozyme motifs. HH Rz, 
5 represents hammerhead ribozyme motif (Usman et al, 1996, Curr. Op. Struct. Bio., 
1, 527); NCH Rz represents the NCH ribozyme motif (Ludwig & Sproat, 
International PCT Publication No. WO 98/58058); G-Cleaver, represents G-cleaver 
ribozyme motif (Kore et al. 9 1998, Nucleic Acids Research 26, 41 16-4120). N or n, 
represent independently a nucleotide which may be same or different and have 

10 complementarity to each other; rl, represents ribo-Inosine nucleotide; arrow 
indicates the site of cleavage within the target. Position 4 of the HH Rz and the NCH 
Rz is shown as having 2'-C-allyl modification, but those skilled in the art will 
recognize that this position can be modified with other modifications well known in 
the art, so long as such modifications do not significantly inhibit the activity of the 

1 5 ribozyme. 

Figure 2 shows an example of the Amberzyme ribozyme motif that is 
chemically stabilized (see, for example, Beigelman et al f International PCT 
publication No. WO 99/55857, incorporated by reference herein; also referred to as 
Class I Motif). The Amberzyme motif is a class of enzymatic nucleic molecules that 
20 do not require the presence of a ribonucleotide (2' -OH) group for its activity. 

Figure 3 shows an example of the Zinzyme A ribozyme motif that is 
chemically stabilized (Beigelman et al, International PCT publication No. WO 
99/55857, incorporated by reference herein; also referred to as Class A or Class II 
Motif). The Zinzyme motif is a class of enzymatic nucleic molecules that do not 
25 require the presence of a ribonucleotide (2'-OH) group for its activity. 

Figure 4 shows an example of a DNAzyme motif described by Santoro et al., 
1997, PNAS, 94,4262. 

Figures 5A and SB are diagrammatic schemes representative of the process 
used for Target Discovery in the instant invention. The process for Target Discovery 
30 is described in Jarvis et al, International PCT publication No. WO 98/50530, 
incorporated by reference herein in its entirety including the Figures. 



Mechanism of action of Nucleic Acid Molecules of the Invention 
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Antisense : Antisense molecules may be modified or unmodified RNA, DNA, 
or mixed polymer oligonucleotides which primarily function by specifically binding 
to matching sequences resulting in inhibition of peptide synthesis (Wu-Pong, Nov 
1994, BioPharm, 20-33). The antisense oligonucleotide binds to target RNA by 
Watson Crick base-pairing and blocks gene expression by preventing ribosomal 
translation of the bound sequences either by steric blocking or by activating RNase 
H enzyme. Antisense molecules can also alter protein synthesis by interfering with 
RNA processing or transport from the nucleus into the cytoplasm (Mukhopadhyay & 
Roth, 1996, Crit. Rev. in Oncogenesis 7, 151-190). 

In addition, binding of single stranded DNA to RNA may result in nuclease 
degradation of the heteroduplex (Wu-Pong, supra; Crooke, supra). To date, the only 
backbone modified DNA chemistry which will act as substrates for RNase H are 
phosphorothioates, phosphorodithioates, and borontrifluoridates. Recently it has 
been reported that 2'-arabino and 2'-fluoro arabino- containing oligos can also 
activate RNase H activity. 

A number of antisense molecules have been described that utilize novel 
configurations of chemically modified nucleotides, secondary structure, and/or 
RNase H substrate domains (Woolf et al, International PCT Publication No. WO 
98/13526; Thompson et a/., International PCT Publication No. WO 99/54459; 
Hartmann et aL, USSN 60/101,174 which was filed on September 21, 1998) all of 
these are incorporated by reference herein in their entirety. 

In addition, antisense deoxyoligoribonucleotides can be used to target RNA by 
means of DNA-RNA interactions, thereby activating RNase H, which digests the 
target RNA in the duplex. Antisense DNA can be expressed endogenously in vivo 
via the use of a single stranded DNA intracellular expression vector or equivalents 
and variations thereof. 

Triplex Forming Oligonucleotides (TFO) : Single stranded DNA may be 
designed to bind to genomic DNA in a sequence specific manner. TFOs are 
comprised of pyrimidine-rich oligonucleotides which bind DNA helices through 
Hoogsteen Base-pairing (Wu-Pong, supra). The resulting triple helix composed of 
the DNA sense, DNA antisense, and TFO disrupts RNA synthesis by RNA 
polymerase. The TFO mechanism may result in gene expression or cell death since 
binding may be irreversible (Mukhopadhyay & Roth, supra). 

2-5A Antisense Chimera : The 2-5A system is an interferon mediated 
mechanism for RNA degradation found in higher vertebrates (Mitra et aL, 1996, 
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Proc Nat Acad Sci USA 93, 6780-6785). Two types of enzymes, 2-5 A synthetase 
and RNase L, are required for RNA cleavage. The 2-5A synthetases require double 
stranded RNA to form 2 '-5' oligoadenylates (2-5 A). 2-5 A then acts as an allosteric 
effector for utilizing RNase L which has the ability to cleave single stranded RNA. 
The ability to form 2-5A structures with double stranded RNA makes this system 
particularly useful for inhibition of viral replication. 

(2 '-5') oligoadenylate structures may be covalently linked to antisense 
molecules to form chimeric oligonucleotides capable of RNA cleavage (Torrence, 
supra). These molecules putatively bind and activate a 2-5A dependent RNase, the 
oligonucleotide/enzyme complex then binds to a target RNA molecule which can 
then be cleaved by the RNase enzyme. 

Enzymatic Nucleic Acid : Seven basic varieties of naturally occurring 
enzymatic RNAs are presently known. In addition, several in vitro selection 
(evolution) strategies (Orgel, 1979, Proc. R. Soc. London, B 205, 435) have been 
used to evolve new nucleic acid catalysts capable of catalyzing cleavage and ligation 
of phosphodiester linkages (Joyce, 1989, Gene, 82, 83-87; Beaudry et al, 1992, 
Science 257, 635-641; Joyce, 1992, Scientific American 267, 90-97; Breaker et al., 
1994, TIBTECH 12, 268; Bartel et a/.,1993, Science 261:1411-1418; Szostak, 1993, 
TIBS 17, 89-93; Kumar et al, 1995, FASEB J., 9, 1183; Breaker, 1996, Curr. Op. 
Biotech., 7, 442; Santoro et al, 1997, Proc. Natl Acad. Sci., 94, 4262; Tang et al, 
1997, RNA 3, 914; Nakamaye & Eckstein, 1994, supra; Long & Uhlenbeck, 1994, 
supra; Ishizaka et al, 1995, supra; Vaish et al, 1997, Biochemistry 36, 6495; all of 
these are incorporated by reference herein). Each can catalyze a series of reactions 
including the hydrolysis of phosphodiester bonds in trans (and thus can cleave other 
RNA molecules) under physiological conditions. 

Nucleic acid molecules of this invention will block to some extent CLCA1 
protein expression and can be used to treat disease or diagnose disease associated 
with the levels of CLCA1 . 

The enzymatic nature of a ribozyme has significant advantages, such as the 
concentration of ribozyme necessary to affect a therapeutic treatment is lower. This 
advantage reflects the ability of the ribozyme to act enzymatically. Thus, a single 
ribozyme molecule is able to cleave many molecules of target RNA. In addition, the 
ribozyme is a highly specific inhibitor, with the specificity of inhibition depending 
not only on the base-pairing mechanism of binding to the target RNA, but also on 
the mechanism of target RNA cleavage. Single mismatches, or base-substitutions, 
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near the site of cleavage can be chosen to completely eliminate catalytic activity of a 
ribozyme. 

Nucleic acid molecules having an endonuclease enzymatic activity are able to 
repeatedly cleave other separate RNA molecules in a nucleotide base sequence- 
5 specific manner. Such enzymatic nucleic acid molecules can be targeted to virtually 
any RNA transcript, and achieve efficient cleavage in vitro (Zaug et aL, 324, Nature 
429 1986 ; Uhlenbeck, 1987 Nature 328, 596; Kim et al., 84 Proc. Natl. Acad. Sci. 
USA 8788, 1987; Dreyfus, 1988, Einstein Quart. J. Bio. Med., 6, 92; Haseloff and 
Gerlach, 334 Nature 585, 1988; Cech, 260 JAMA 3030, 1988; and Jefferies et al, 17 
10 Nucleic Acids Research 1371, 1989; Santoro et al., 1997 supra). 

Because of their sequence specificity, fr^ms-cleaving ribozymes show promise 
as therapeutic agents for human disease (Usman and McSwiggen, 1995 Ann. Rep. 
Med. Chem. 30, 285-294; Christoffersen and Marr, 1995 /. Med. Chem. 38, 2023- 
2037). Ribozymes can be designed to cleave specific RNA targets within the 
15 background of cellular RNA. Such a cleavage event renders the RNA non- 
functional and abrogates protein expression from that RNA. In this manner, 
synthesis of a protein associated with a disease state can be selectively inhibited 
(Warashina et al y 1999, Chemistry and Biology, 6, 237-250). 

The nucleic acid molecules of the instant invention are also referred to as 
20 GeneBloc reagents, which are essentially nucleic acid molecules (e.g.; ribozymes, 
antisense) capable of down-regulating gene expression. 

GeneBlocs are modified oligonucleotides including ribozymes and modified 
antisense oligonucleotides that bind to and target specific mRNA molecules. 
Because GeneBlocs can be designed to target any specific mRNA, their potential 

25 applications are quite broad. Traditional antisense approaches have often relied 
heavily on the use of phosphorothioate modifications to enhance stability in 
biological samples, leading to a myriad of specificity problems stemming from non- 
specific protein binding and general cytotoxicity (Stein, 1995, Nature Medicine, 1, 
1119). In contrast, GeneBlocs contain a number of modifications that confer 

30 nuclease resistance while making minimal use of phosphorothioate linkages, which 
reduces toxicity, increases binding affinity and minimizes non-specific effects 
compared with traditional antisense oligonucleotides. Similar reagents have recently 
been utilized successfully in various cell culture systems (Vassar, et al., 1999, 
Science, 286, 735) and in vivo (Jarvis et al., manuscript in preparation). In addition, 

35 novel cationic lipids can be utilized to enhance cellular uptake in the presence of 
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serum. Since ribozymes and antisense oligonucleotides regulate gene expression at 
the RNA level, the ability to maintain a steady-state dose of GeneBloc over several 
days was important for target protein and phenotypic analysis. The advances in 
resistance to nuclease degradation and prolonged activity in vitro have supported the 
5 use of GeneBlocs in target validation applications. 

Target sites 

Targets for useful ribozymes and antisense nucleic acids can be determined 
as disclosed in Draper et al, WO 93/23569; Sullivan et al. f WO 93/23057; 
Thompson et al, WO 94/02595; Draper et al, WO 95/04818; McSwiggen et al, US 

1 0 Patent No. 5,525,468. All of these publications are hereby incorporated by reference 
herein in their totality. Other examples include the following PCT applications, 
which concern inactivation of expression of disease-related genes: WO 95/23225, 
WO 95/13380, WO 94/02595, all of which are incorporated by reference herein. 
Rather than repeat the guidance provided in those documents here, specific examples 

15 of such methods are provided herein, not limiting to those in the art. Ribozymes and 
antisense to such targets are designed as described in those applications and 
synthesized to be tested in vitro and in vivo, as also described. The sequences of 
human CLCA1 RNAs were screened for optimal enzymatic nucleic acid and 
antisense target sites using a computer-folding algorithm. Antisense, hammerhead, 

20 DNAzyme, NCH, amberzyme, zinzyme, or G-Cleaver ribozyme binding/cleavage 
sites were identified. These sites are shown in Tables III to IX (all sequences are 5' 
to 3' in the tables; the underlined region can be any base-paired sequence, the actual 
sequence is not relevant here). The nucleotide base position is noted in the Tables as 
that site to be cleaved by the designated type of enzymatic nucleic acid molecule. 

25 While human sequences can be screened and enzymatic nucleic acid molecule 
and/or antisense thereafter designed, as discussed in Stinchcomb et al, WO 
95/23225, mouse targeted ribozymes may be useful to test efficacy of action of the 
enzymatic nucleic acid molecule and/or antisense prior to testing in humans. 

Antisense, hammerhead, DNAzyme, NCH, amberzyme, zinzyme or G-Cleaver 
30 ribozyme binding/cleavage sites were identified. The nucleic acid molecules are 
individually analyzed by computer folding (Jaeger et al, 1989 Proc. Natl Acad. Sci. 
USA, 86, 7706) to assess whether the sequences fold into the appropriate secondary 
structure. Those nucleic acid molecules with unfavorable intramolecular 
interactions such as between the binding arms and the catalytic core are eliminated 
35 from consideration. Varying binding arm lengths can be chosen to optimize activity. 



19 



MBHB00-814-A (249.021) 



Antisense, hammerhead, DNAzyme, NCH, amberzyme, zinzyme or G-Cleaver 
ribozyme binding/cleavage sites were identified and were designed to anneal to 
various sites in the RNA target. The binding arms are complementary to the target 
site sequences described above. The nucleic acid molecules were chemically 
5 synthesized. The method of synthesis used follows the procedure for normal 
DNA/RNA synthesis as described below and in Usman et al., 1987 J. Am. Chem. 
Soc, 109, 7845; Scaringe et al, 1990 Nucleic Acids Res., 18, 5433; Wincott et al, 
1995 Nucleic Acids Res. 23, 2677-2684; and Caruthers et al, 1992, Methods in 
Enzymology 211,3-19. 

10 Synthesis of Nucleic acid Molecules 

Synthesis of nucleic acids greater than 100 nucleotides in length is difficult 
using automated methods, and the therapeutic cost of such molecules is prohibitive. 
In this invention, small nucleic acid motifs ("small refers to nucleic acid motifs no 
more than 100 nucleotides in length, preferably no more than 80 nucleotides in 

15 length, and most preferably no more than 50 nucleotides in length; e.g., antisense 
oligonucleotides, hammerhead or the NCH ribozymes) are preferably used for 
exogenous delivery. The simple structure of these molecules increases the ability of 
the nucleic acid to invade targeted regions of RNA structure. Exemplary molecules 
of the instant invention are chemically synthesized, and others can similarly be 

20 synthesized. 

Oligonucleotides (e.g.; antisense GeneBlocs) are synthesized using protocols 
known in the art as described in Caruthers et al., 1992, Methods in Enzymology 111, 
3-19, Thompson et al, International PCT Publication No. WO 99/54459, Wincott et 
al, 1995, Nucleic Acids Res. 23, 2677-2684, Wincott et al, 1997, Methods Mol 

25 Bio., 74, 59, Brennan et al, 1998, Biotechnol Bioeng. f 61, 33-45, and Brennan, US 
patent No. 6,001,311. All of these references are incorporated herein by reference. 
The synthesis of oligonucleotides makes use of common nucleic acid protecting and 
coupling groups, such as dimethoxytrityl at the 5'-end, and phosphoramidites at the 
3'-end. In a non-limiting example, small scale syntheses are conducted on a 394 

30 Applied Biosystems, Inc. synthesizer using a 0.2 (imol scale protocol with a 2.5 min 
coupling step for 2'-0-methylated nucleotides and a 45 sec coupling step for 2'- 
deoxy nucleotides. Table II outlines the amounts and the contact times of the 
reagents used in the synthesis cycle. Alternatively, syntheses at the 0.2 jumol scale 
can be performed on a 96-well plate synthesizer, such as the instrument produced by 

35 Protogene (Palo Alto, CA) with minimal modification to the cycle. A 33-fold excess 
(60 nL of 0.11 M = 6.6 |imol) of 2'-0-methyl phosphoramidite and a 105-fold 
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excess of S-ethyl tetrazole (60 \iL of 0.25 M = 15 nmol) can be used in each 
coupling cycle of 2'-0-methyl residues relative to polymer-bound 5'-hydroxyl. A 
22-fold excess (40 \xL of 0. 1 1 M = 4.4 jimol) of deoxy phosphoramidite and a 70- 
fold excess of S-ethyl tetrazole (40 of 0.25 M = 10 (irnol) can be used in each 
5 coupling cycle of deoxy residues relative to polymer-bound 5'-hydroxyl. Average 
coupling yields on the 394 Applied Biosystems, Inc. synthesizer, determined by 
colorimetric quantitation of the trityl fractions, are typically 97.5-99%. Other 
oligonucleotide synthesis reagents for the 394 Applied Biosystems, Inc. synthesizer 
include; detritylation solution is 3% TCA in methylene chloride (ABI); capping is 
10 performed with 16% AT-methyl imidazole in THF (ABI) and 10% acetic 
anhydride/10% 2,6-lutidine in THF (ABI); and oxidation solution is 16.9 mM 12, 49 

_ mM pyridine, 9% water in THF (PERSEPTIVE™). Burdick & Jackson Synthesis 

Grade acetonitrile is used directly from the reagent bottle. S-Ethyltetrazole solution 
(0.25 M in acetonitrile) is made up from the solid obtained from American 

fit 

kj 1 5 International Chemical, Inc. Alternately, for the introduction of phosphorothioate 
O linkages, Beaucage reagent (3H-l,2-Benzodithiol-3-one 1,1-dioxide, 0.05 M in 

acetonitrile) is used. 

s 

G Deprotection of the antisense oligonucleotides is performed as follows: the 

polymer-bound trityl-on oligoribonucleotide is transferred to a 4 mL glass screw top 
20 vial and suspended in a solution of 40% aq. methylamine (1 mL) at 65 °C for 10 

p min. After cooling to -20 °C, the supernatant is removed from the polymer support. 

^ The support is washed three times with 1 .0 mL of EtOH:MeCN:H20/3 : 1 : 1 , vortexed 

and the supernatant is then added to the first supernatant. The combined 
supernatants, containing the oligoribonucleotide, are dried to a white powder. 

25 The method of synthesis used for normal RNA including certain enzymatic 

nucleic acid molecules follows the procedure as described in Usman et al, 1987, J. 
Am. Chem. Soc, 109, 7845; Scaringe et aL, 1990, Nucleic Acids Res., 18, 5433; 
Wincott et aL, 1995, Nucleic Acids Res. 23, 2677-2684 and Wincott et al. t 1997, 
Methods Mol. Bio., 74, 59, and makes use of common nucleic acid protecting and 

30 coupling groups, such as dimethoxytrityl at the 5 ? -end, and phosphoramidites at the 
3'-end. In a non-limiting example, small scale syntheses are conducted on a 394 
Applied Biosystems, Inc. synthesizer using a 0.2 (imol scale protocol with a 7.5 min 
coupling step for alkylsilyl protected nucleotides and a 2.5 min coupling step for 2'- 
O-methylated nucleotides. Table II outlines the amounts and the contact times of 

35 the reagents used in the synthesis cycle. Alternatively, syntheses at the 0.2 jimol 
scale can be done on a 96-well plate synthesizer, such as the instrument produced by 
Protogene (Palo Alto, CA) with minimal modification to the cycle. A 3 3 -fold excess 
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(60 (xL of 0.1 1 M = 6.6 |imol) of 2'-0-methyl phosphoramidite and a 75-fold excess 
of S-ethyl tetrazole (60 jxL of 0.25 M = 15 jimol) can be used in each coupling cycle 
of 2 J -0-methyl residues relative to polymer-bound 5'-hydroxyl. A 66-fold excess 
(120 jxL of 0.11 M = 13.2 jamol) of alkylsilyl (ribo) protected phosphoramidite and a 
150-fold excess of S-ethyl tetrazole (120 jiL of 0.25 M = 30 jimol) can be used in 
each coupling cycle of ribo residues relative to polymer-bound S'-hydroxyl. 
Average coupling yields on the 394 Applied Biosystems, Inc. synthesizer, 
determined by colorimetric quantitation of the trityl fractions, are typically 97.5- 
99%. Other oligonucleotide synthesis reagents for the 394 Applied Biosystems, Inc. 
synthesizer include; detritylation solution is 3% TCA in methylene chloride (ABI); 
capping is performed with 16% TV-methyl imidazole in THF (ABI) and 10% acetic 
anhydride/10% 2,6-lutidine in THF (ABI); oxidation solution is 16.9 mM 12, 49 mM 

pyridine, 9% water in THF (PERSEPTIVE™). Burdick & Jackson Synthesis Grade 
acetonitrile is used directly from the reagent bottle. S-Ethyltetrazole solution (0.25 
M in acetonitrile) is made up from the solid obtained from American International 
Chemical, Inc. Alternately, for the introduction of phosphorothioate linkages, 
Beaucage reagent (3H-l,2-Benzodithiol-3-one l,l-dioxide0.05 M in acetonitrile) is 
used. 

Deprotection of the RNA is performed using either a two-pot or one-pot 

protocol. For the two-pot protocol, the polymer-bound trityl-on oligoribonucleotide 

is transferred to a 4 mL glass screw top vial and suspended in a solution of 40% aq. 

methylamine (1 mL) at 65 °C for 10 min. After cooling to -20 °C, the supernatant is 

removed from the polymer support. The support is washed three times with 1.0 mL 

of EtOH:MeCN:H20/3:l:l, vortexed and the supernatant is then added to the first 

supernatant. The combined supernatants, containing the oligoribonucleotide, are 

dried to a white powder. The base deprotected oligoribonucleotide is resuspended in 

anhydrous TEA/HF/NMP solution (300 fiL of a solution of 1.5 mL N- 

methylpyrrolidinone, 750 ^iL TEA and 1 mL TEAOHF to provide a 1.4 M HF 

concentration) and heated to 65 °C. After 1.5 h, the oligomer is quenched with 1.5 
M NH4HCO3. 

Alternatively, for the one-pot protocol, the polymer-bound trityl-on 

oligoribonucleotide is transferred to a 4 mL glass screw top vial and suspended in a 

solution of 33% ethanolic methylamine/DMSO: 1/1 (0.8 mL) at 65 °C for 15 min. 

The vial is brought to r.t. TEAOHF (0.1 mL) is added and the vial is heated at 65 °C 

for 15 min. The sample is cooled at -20 °C and then quenched with 1.5 M 
NH4HCO3. 
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For purification of the trityl-on oligomers, the quenched NH4HCO3 solution is 

loaded onto a C-18 containing cartridge that had been prewashed with acetonitrile 
followed by 50 mM TEAA. After washing the loaded cartridge with water, the RNA 
is detritylated with 0.5% TFA for 13 min. The cartridge is then washed again with 



water, salt exchanged with 1 M NaCl and washed with water again. The 
oligonucleotide is then eluted with 30% acetonitrile. 

Inactive hammerhead ribozymes or binding attenuated control (BAC) 
oligonucleotides) are synthesized by substituting a U for G5 and a U for A14 

(numbering from Hertel, K. J., et aL, 1992, Nucleic Acids Res._, 20, 3252). Similarly, 
one or more nucleotide substitutions can be introduced in other enzymatic nucleic 
acid molecules to inactivate the molecule and such molecules can serve as a negative 
control. 

The average stepwise coupling yields are typically >98% (Wincott et aL, 1995 
Nucleic Acids Res. 23, 2677-2684). Those of ordinary skill in the art will recognize 
that the scale of synthesis can be adapted to be larger or smaller than the examples 
described above including but not limited to 96-well format, all that is important is 
the ratio of chemicals used in the reaction. 

Alternatively, the nucleic acid molecules of the present invention can be 
synthesized separately and joined together post-synthetically, for example by ligation 
(Moore et aL, 1992, Science 256, 9923; Draper et aL, International PCT publication 
No. WO 93/23569; Shabarova et aL, 1991, Nucleic Acids Research 19, 4247; Bellon 
et aL, 1997, Nucleosides & Nucleotides, 16, 951; Bellon et aL, 1997, Bioconjugate 
Chem. 8, 204). 

The nucleic acid molecules of the present invention are modified extensively 
to enhance stability by modification with nuclease resistant groups, for example, 2 1 - 
amino, 2-C-allyl, 2 , -flouro, 2'-0-methyl, 2-H (for a review see Usman and 
Cedergren, 1992, TIBS 17, 34; Usman et aL, 1994, Nucleic Acids Symp. Ser. 31, 
163). Ribozymes are purified by gel electrophoresis using general methods or are 
purified by high pressure liquid chromatography (HPLC; See Wincott et aL, supra, 
the totality of which is hereby incorporated herein by reference) and are re- 
suspended in water. 

The sequences of the ribozymes and antisense constructs that are chemically 
synthesized, useful in this study, are shown in Tables III to IX. Those in the art 
will recognize that these sequences are representative only of many more such 
sequences where the enzymatic portion of the ribozyme (all but the binding arms) is 
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altered to affect activity. The ribozyme and antisense construct sequences listed in 
Tables III to IX may be formed of ribonucleotides or other nucleotides or non- 
nucleotides. Such ribozymes with enzymatic activity are equivalent to the ribozymes 
described specifically in the Tables. 

Optimizing Activity of the nucleic acid molecule of the invention. 

Chemically synthesizing nucleic acid molecules with modifications (base, 
sugar and/or phosphate) that prevent their degradation by serum ribonucleases may 
increase their potency (see e.g., Eckstein et al, International Publication No. 
WO 92/07065; Perrault et al, 1990 Nature 344, 565; Pieken et al, 1991, Science 
253, 314; Usman and Cedergren, 1992, Trends in Biochem. Sci. 17, 334; Usman et 
al, International Publication No. WO 93/15187; Rossi et al, International 
Publication No. WO 91/03162; Sproat, US Patent No. 5,334,711; and Burgin et al, 
supra; all of these describe various chemical modifications that can be made to the 
base, phosphate and/or sugar moieties of the nucleic acid molecules described 
herein. All these references are incorporated by reference herein. Modifications 
which enhance their efficacy in cells, and removal of bases from nucleic acid 
molecules to shorten oligonucleotide synthesis times and reduce chemical 
requirements are desired. 

There are several examples in the art describing sugar, base and phosphate 
modifications that can be introduced into nucleic acid molecules with significant 
enhancement in their nuclease stability and efficacy. For example, oligonucleotides 
are modified to enhance stability and/or enhance biological activity by modification 
with nuclease resistant groups, for example, 2'-amino, 2-C-allyl, 2 , -flouro, T-O- 
methyl, 2'-H, nucleotide base modifications (for a review see Usman and Cedergren, 
1992, TIBS. 17, 34; Usman et aL, 1994, Nucleic Acids Symp. Ser. 31, 163; Burgin et 
al, 1996, Biochemistry , 35, 14090). Sugar modifications of nucleic acid molecules 
have been extensively described in the art (see Eckstein et al, International 
Publication PCT No. WO 92/07065; Perrault et al Nature, 1990, 344, 565-568; 
Pieken et al Science, 1991, 253, 314-317; Usman and Cedergren, Trends in 
Biochem. Sci. , 1992, 77,334-339; Usman et al International Publication PCT No. 
WO 93/15187; Sproat, US Patent No. 5,334,711 and Beigelman et aL 1995, J. 
Biol Chem., 270, 25702; Beigelman et al, International PCT publication No. WO 
97/26270; Beigelman et al., US Patent No. 5,716,824; Usman et al, US patent No. 
5,627,053; Woolf et al., International PCT Publication No. WO 98/13526; 
Thompson et al, USSN 60/082,404 which was filed on April 20, 1998; Karpeisky et 
al, 1998, Tetrahedron Lett., 39, 1131; Earnshaw and Gait, 1998, Biopolymers 
(Nucleic acid Sciences), 48, 39-55; Verma and Eckstein, 1998, Annu. Rev. Biochem., 
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67, 99-134; and Burlina et al., 1997, Bioorg. Med. Chem., 5, 1999-2010; all of the 
references are hereby incorporated by reference herein in their totalities). Such 
publications describe general methods and strategies to determine the location of 
incorporation of sugar, base and/or phosphate modifications and the like into 
5 ribozymes without inhibiting catalysis. In view of such teachings, similar 
modifications can be used as described herein to modify the nucleic acid molecules 
of the instant invention. 

While chemical modification of oligonucleotide internucleotide linkages with 
phosphorothioate, phosphorothioate, and/or 5'-methylphosphonate linkages 
10 improves stability, too many of these modifications may cause some toxicity. 
Therefore when designing nucleic acid molecules the amount of these 
2 internucleotide linkages should be minimized. The reduction in the concentration of 

*0 these linkages should lower toxicity resulting in increased efficacy and higher 

nil 

specificity of these molecules. 

O 

JS 15 Nucleic acid molecules having chemical modifications which maintain or 

y 5 enhance activity are provided. Such nucleic acid is also generally more resistant to 

p nucleases than unmodified nucleic acid. Thus, in a cell and/or in vivo the activity 

05 may not be significantly lowered. Therapeutic nucleic acid molecules delivered 

fg exogenously must optimally be stable within cells until translation of the target RNA 

Q 20 has been inhibited long enough to reduce the levels of the undesirable protein. This 
^ period of time varies between hours to days depending upon the disease state. 

Clearly, nucleic acid molecules must be resistant to nucleases in order to function as 
effective intracellular therapeutic agents. Improvements in the chemical synthesis of 
RNA and DNA (Wincott et al, 1995 Nucleic Acids Res. 23, 2677; Caruthers et al, 
25 1992, Methods in Enzymology 211,3-19 (incorporated by reference herein) have 
expanded the ability to modify nucleic acid molecules by introducing nucleotide 
modifications to enhance their nuclease stability as described above. 

Use of these the nucleic acid-based molecules of the invention will lead to 
better treatment of the disease progression by affording the possibility of 

30 combination therapies {e.g., multiple antisense or enzymatic nucleic acid molecules 
targeted to different genes, nucleic acid molecules coupled with known small 
molecule inhibitors, or intermittent treatment with combinations of molecules 
(including different motifs) and/or other chemical or biological molecules). The 
treatment of patients with nucleic acid molecules may also include combinations of 

35 different types of nucleic acid molecules. 
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Therapeutic nucleic acid molecules (e.g., enzymatic nucleic acid molecules 
and antisense nucleic acid molecules) delivered exogenously must optimally be 
stable within cells until translation of the target RNA has been inhibited long enough 
to reduce the levels of the undesirable protein. This period of time varies between 
hours to days depending upon the disease state. Clearly, these nucleic acid 
molecules must be resistant to nucleases in order to function as effective intracellular 
therapeutic agents. Improvements in the chemical synthesis of nucleic acid 
molecules described in the instant invention and in the art have expanded the ability 
to modify nucleic acid molecules by introducing nucleotide modifications to enhance 
their nuclease stability as described above. 

By "enhanced enzymatic activity" is meant to include activity measured in 
cells and/or in vivo where the activity is a reflection of both catalytic activity and 
ribozyme stability. In this invention, the product of these properties is increased or 
not significantly (less than 10-fold) decreased in vivo compared to an all RNA 
ribozyme or all DNA enzyme. 

In yet another preferred embodiment, nucleic acid catalysts having chemical 
modifications which maintain or enhance enzymatic activity are provided. Such 
nucleic acid is also generally more resistant to nucleases than unmodified nucleic 
acid. Thus, in a cell and/or in vivo the activity may not be significantly lowered. As 
exemplified herein such ribozymes are useful in a cell and/or in vivo even if activity 
over all is reduced 10 fold (Burgin et al, 1996, Biochemistry, 35, 14090). Such 
ribozymes herein are said to "maintain" the enzymatic activity of an all RNA 
ribozyme. 

In another aspect the nucleic acid molecules comprise a 5' and/or a 3 5 - cap 
structure. 

By "cap structure" is meant chemical modifications, which have been 
incorporated at either terminus of the oligonucleotide (see, for example, Wincott et 
al, WO 97/2627 '0, incorporated by reference herein). These terminal modifications 
protect the nucleic acid molecule from exonuclease degradation, and may help in 
delivery and/or localization within a cell. The cap may be present at the 5' -terminus 
(5 '-cap) or at the 3 '-terminus (3 '-cap) or may be present on both termini. In non- 
limiting examples the 5 '-cap is selected from the group comprising inverted abasic 
residue (moiety), 4',5'-methylene nucleotide; l-(beta-D-erythrofiiranosyl) nucleotide, 
4'-thio nucleotide, carbocyclic nucleotide; 1,5-anhydrohexitol nucleotide; L- 
nucleotides; alpha-nucleotides; modified base nucleotide; phosphorodithioate 




26 MBHBOO-8 1 4-A (249.02 1 ) 

linkage; ^reo-pentofuranosyl nucleotide; acyclic 3',4-seco nucleotide; acyclic 3,4- 
dihydroxybutyl nucleotide; acyclic 3,5-dihydroxypentyl nucleotide, 3-3 '-inverted 
nucleotide moiety; 3 f -3-inverted abasic moiety; 3 '^-inverted nucleotide moiety; 3 1 - 
2-inverted abasic moiety; 1 ,4-butanediol phosphate; 3 f -phosphoramidate; 
5 hexylphosphate; aminohexyl phosphate; 3-phosphate; 3 ? -phosphorothioate; 
phosphorodithioate; or bridging or non-bridging methylphosphonate moiety (for 
more details see Wincott et al, International PCT publication No. WO 97/26270, 
incorporated by reference herein). 

In yet another preferred embodiment, the 3 '-cap is selected from a group 

10 comprising, 4\ 5 '-methylene nucleotide; 1 -(beta-D-erythrofuranosyl) nucleotide; 4 ? - 

^ thio nucleotide, carbocyclic nucleotide; S'-amino-alkyl phosphate; l,3-diamino-2- 

3 propyl phosphate, 3-aminopropyl phosphate; 6-aminohexyl phosphate; 1,2- 

i aminododecyl phosphate; hydroxypropyl phosphate; 1,5-anhydrohexitol nucleotide; 

j L-nucleotide; alpha-nucleotide; modified base nucleotide; phosphorodithioate; 

i 15 f/zreo-pentofuranosyl nucleotide; acyclic 3\4'-seco nucleotide; 3,4-dihydroxybutyl 

i nucleotide; 3,5-dihydroxypentyl nucleotide, S'-S-inverted nucleotide moiety; 5-5- 
inverted abasic moiety; 5-phosphoramidate; 5-phosphorothioate; 1 ,4-butanediol 

i 

\ phosphate; 5-amino; bridging and/or non-bridging S'-phosphoramidate, 

1 phosphorothioate and/or phosphorodithioate, bridging or non bridging 

f 20 methylphosphonate and S'-mercapto moieties (for more details, see Beaucage and 

* s Iyer, 1993, Tetrahedron 49, 1925; incorporated by reference herein). 

By the term "non-nucleotide" is meant any group or compound which can be 
incorporated into a nucleic acid chain in the place of one or more nucleotide units, 
including either sugar and/or phosphate substitutions, and allows the remaining 
25 bases to exhibit their enzymatic activity. The group or compound is abasic in that it 
does not contain a commonly recognized nucleotide base, such as adenosine, 
guanine, cytosine, uracil or thymine. 

An "alkyl" group refers to a saturated aliphatic hydrocarbon, including 
straight-chain, branched-chain, and cyclic alkyl groups. Preferably, the alkyl group 
30 has 1 to 12 carbons. More preferably it is a lower alkyl of from 1 to 7 carbons, more 
preferably 1 to 4 carbons. The alkyl group may be substituted or unsubstituted. 
When substituted the substituted group(s) is preferably, hydroxyl, cyano, alkoxy, 
=0, =S, NO2 or N(CH3)2, amino, or SH. The term also includes alkenyl groups 

which are unsaturated hydrocarbon groups containing at least one carbon-carbon 
35 double bond, including straight-chain, branched-chain, and cyclic groups. 
Preferably, the alkenyl group has 1 to 12 carbons. More preferably it is a lower 
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alkenyl of from 1 to 7 carbons, more preferably 1 to 4 carbons. The alkenyl group 
may be substituted or unsubstituted. When substituted the substituted group(s) is 
preferably, hydroxyl, cyano, alkoxy, =0, =S, NO2, halogen, N(CH3)2» amino, or 

SH. The term "alkyl" also includes alkynyl groups which have an unsaturated 
5 hydrocarbon group containing at least one carbon-carbon triple bond, including 
straight-chain, branched-chain, and cyclic groups. Preferably, the alkynyl group has 
1 to 12 carbons. More preferably it is a lower alkynyl of from 1 to 7 carbons, more 
preferably 1 to 4 carbons. The alkynyl group may be substituted or unsubstituted. 
When substituted the substituted group(s) is preferably, hydroxyl, cyano, alkoxy, 
1 0 =0, =S, NO2 or N(CH3)2, amino or SH. 

Such alkyl groups may also include aryl, alkylaryl, carbocyclic aryl, 
heterocyclic aryl, amide and ester groups. An "aryl" group refers to an aromatic 
group which has at least one ring having a conjugated n electron system and includes 
carbocyclic aryl, heterocyclic aryl and biaryl groups, all of which may be optionally 

1 5 substituted. The preferred substituent(s) of aryl groups are halogen, trihalomethyl, 
hydroxyl, SH, OH, cyano, alkoxy, alkyl, alkenyl, alkynyl, and amino groups. An 
"alkylaryl" group refers to an alkyl group (as described above) covalently joined to 
an aryl group (as described above). Carbocyclic aryl groups are groups wherein the 
ring atoms on the aromatic ring are all carbon atoms. The carbon atoms are 

20 optionally substituted. Heterocyclic aryl groups are groups having from 1 to 3 
heteroatoms as ring atoms in the aromatic ring and the remainder of the ring atoms 
are carbon atoms. Suitable heteroatoms include oxygen, sulfur, and nitrogen, and 
include furanyl, thienyl, pyridyl, pyrrolyl, N-lower alkyl pyrrolo, pyrimidyl, 
pyrazinyl, imidazolyl and the like, all optionally substituted. An "amide" refers to an 

25 -C(0)-NH-R, where R is either alkyl, aryl, alkylaryl or hydrogen. An "ester" refers 
to an -C(0)-OR f , where R is either alkyl, aryl, alkylaryl or hydrogen. 

By "nucleotide" as used herein is as recognized in the art to include natural 
bases (standard), and modified bases well known in the art. Such bases are generally 
located at the l 1 position of a nucleotide sugar moiety. Nucleotides generally 

30 comprise a base, sugar and a phosphate group. The nucleotides can be unmodified or 
modified at the sugar, phosphate and/or base moiety, (also referred to 
interchangeably as nucleotide analogs, modified nucleotides, non-natural 
nucleotides, non-standard nucleotides and other; see for example, Usman and 
McSwiggen, supra; Eckstein et ah, International PCT Publication No. WO 

35 92/07065; Usman et at. f International PCT Publication No. WO 93/15187; Uhlmann 
& Peyman, 1990, Chemical Reviews, 90, 4, 544-579, all are hereby incorporated by 
reference herein). There are several examples of modified nucleic acid bases known 
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in the art as summarized by Limbach et al., 1994, Nucleic Acids Res. 22, 2183. 
Some of the non-limiting examples of base modifications that can be introduced into 
nucleic acid molecules include, inosine, purine, pyridin-4-one, pyridin-2-one, 
phenyl, pseudouracil, 2, 4, 6-trimethoxy benzene, 3-methyl uracil, dihydrouridine, 
5 naphthyl, aminophenyl, 5-alkylcytidines (e.g., 5-methylcytidine), 5-alkyluridines 
(e.g., ribothymidine), 5-halouridine (e.g., 5-bromouridine) or 6-azapyrimidines or 6- 
alkylpyrimidines (e.g. 6-methyluridine), propyne, and others (Burgin et al, 1996, 
Biochemistry, 35, 14090; Uhlman & Peyman, supra). By "modified bases" in this 
aspect is meant nucleotide bases other than adenine, guanine, cytosine and uracil at 
10 T position or their equivalents; such bases may be used at any position, for example, 
within the catalytic core of an enzymatic nucleic acid molecule and/or in the 
substrate-binding regions of the nucleic acid molecule. 

In a preferred embodiment, the invention features modified ribozymes with 
phosphate backbone modifications comprising one or more phosphorothioate, 

1 5 phosphorodithioate, methylphosphonate, morpholino, amidate carbamate, 
carboxymethyl, acetamidate, polyamide, sulfonate, sulfonamide, sulfamate, 
formacetal, thioformacetal, and/or alkylsilyl, substitutions. For a review of 
oligonucleotide backbone modifications see Hunziker and Leumann, 1995, Nucleic 
Acid Analogues: Synthesis and Properties, in Modern Synthetic Methods, VCH, 

20 331-417, and Mesmaeker et al, 1994, Novel Backbone Replacements for 
Oligonucleotides, in Carbohydrate Modifications in Antisense Research, ACS, 24- 
39. These references are hereby incorporated by reference herein. 

By "abasic" is meant sugar moieties lacking a base or having other chemical 
groups in place of a base at the V position, (for more details, see Wincott et al, 
25 International PCT publication No. WO 97/26270). 

By "unmodified nucleoside" is meant one of the bases adenine, cytosine, 
guanine, thymine, uracil joined to the Y carbon of P-D-ribo-furanose. 

By "modified nucleoside" is meant any nucleotide base which contains a 
modification in the chemical structure of an unmodified nucleotide base, sugar 
30 and/or phosphate. 

In connection with 2 , -modified nucleotides as described for the present 
invention, by "amino" is meant 2'-NH 2 or I'-O- NH 2 , which may be modified or 
unmodified. Such modified groups are described, for example, in Eckstein et al., 
U.S. Patent 5,672,695 and Matulic-Adamic et al., WO 98/28317, respectively, which 
35 are both incorporated by reference herein in their entireties. 
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Various modifications to nucleic acid (e.g., antisense and ribozyme) structure 
can be made to enhance the utility of these molecules. Such modifications will 
enhance shelf-life, half-life in vitro, stability, and ease of introduction of such 
oligonucleotides to the target site, e.g., to enhance penetration of cellular 
membranes, and confer the ability to recognize and bind to targeted cells. 

Use of these molecules will lead to better treatment of the disease progression 
by affording the possibility of combination therapies (e.g., multiple ribozymes 
targeted to different genes, ribozymes coupled with known small molecule 
inhibitors, or intermittent treatment with combinations of ribozymes (including 
different ribozyme motifs) and/or other chemical or biological molecules). The 
treatment of patients with nucleic acid molecules may also include combinations of 
different types of nucleic acid molecules. Therapies may be devised which include a 
mixture of ribozymes (including different ribozyme motifs), antisense and/or 2-5A 
chimera molecules to one or more targets to alleviate symptoms of a disease. 

Administration of Nucleic Acid Molecules 

Methods for the delivery of nucleic acid molecules are described in Akhtar et 
al, 1992, Trends Cell Bio., 2, 139; and Delivery Strategies for Antisense 
Oligonucleotide Therapeutics, ed. Akhtar, 1995 which are both incorporated herein 
by reference. Sullivan et al, PCT WO 94/02595, further describes the general 
methods for delivery of enzymatic RNA molecules. These protocols may be utilized 
for the delivery of virtually any nucleic acid molecule. Nucleic acid molecules may 
be administered to cells by a variety of methods known to those familiar to the art, 
including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by 
incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable 
nanocapsules, and bioadhesive microspheres. For some indications, nucleic acid 
molecules may be directly delivered ex vivo to cells or tissues with or without the 
aforementioned vehicles. Alternatively, the nucleic acid/vehicle combination is 
locally delivered by direct injection or by use of a catheter, infusion pump or stent. 
Other routes of delivery include, but are not limited to, intravascular, intramuscular, 
subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, 
systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed 
descriptions of nucleic acid delivery and administration are provided in Sullivan et 
al, supra, Draper et al, PCT W093/23569, Beigelman et al, PCT WO99/05094, 
and Klimuk et al, PCT WO99/04819 all of which have been incorporated by 
reference herein. 
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In addition, the nucleic acid molecules of the instant invention, used to treat 
pulmonary diseases and disorders, may be administered directly to the lungs via 
pulmonary delivery. The pulmonary delivery of oligonucleotides is described by 
Bennett et al, International PCT publication Nos. WO/9960166 and WO/9960010; 
5 Danahay et al, 1999, Pharm. Res., 16(10), 1542-1549; Metzger and Nyce, 1999, J. 
Allergy Clin. Immunol., 104(2, Pt 1), 260-266; Nicklin et al, 1998, Pharm, Res., 
15(4), 583-591; Ilium and Watts, International PCT publication No. WO/9735562; 
and Nyce, 1997, Expert Opin. Invest Drugs, 6(9), 1 149-1 156. 

The molecules of the instant invention can be used as pharmaceutical agents. 
1 0 Pharmaceutical agents prevent, inhibit the occurrence, or treat (alleviate a symptom 
to some extent, preferably all of the symptoms) of a disease state in a patient. 

The negatively charged polynucleotides of the invention can be administered 
{e.g., RNA, DNA or protein) and introduced into a patient by any standard means, 
with or without stabilizers, buffers, and the like, to form a pharmaceutical 
1 5 composition. When it is desired to use a liposome delivery mechanism, standard 
protocols for formation of liposomes can be followed. The compositions of the 
present invention may also be formulated and used as tablets, capsules or elixirs for 
oral administration; suppositories for rectal administration; sterile solutions; 
suspensions for injectable administration; and other compositions known in the art. 

20 The present invention also includes pharmaceutically acceptable formulations 

of the compounds described. These formulations include salts of the above 
compounds, e.g., acid addition salts, including salts of hydrochloric, hydrobromic, 
acetic acid, and benzene sulfonic acid. 

A pharmacological composition or formulation refers to a composition or 
25 formulation in a form suitable for administration, e.g., systemic administration, into 
a cell or patient, preferably a human. Suitable forms, in part, depend upon the use or 
the route of entry, for example oral, transdermal, or by injection. Such forms should 
not prevent the composition or formulation from reaching a target cell {i.e., a cell to 
which the negatively charged polymer is desired to be delivered to). For example, 
30 pharmacological compositions injected into the blood stream should be soluble. 
Other factors are known in the art, and include considerations such as toxicity and 
forms which prevent the composition or formulation from exerting its effect.By 
"systemic administration" is meant in vivo systemic absorption or accumulation of 
drugs in the blood stream followed by distribution throughout the entire body. 
35 Administration routes that lead to systemic absorption include, without limitations: 
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intravenous, subcutaneous, intraperitoneal, inhalation, oral, intrapulmonary and 
intramuscular. Each of these administration routes exposes the desired negatively 
charged polymers, e.g., nucleic acids, to an accessible diseased tissue. The rate of 
entry of a drug into the circulation has been shown to be a function of molecular 
weight or size. The use of a liposome or other drug carrier comprising the 
compounds of the instant invention can potentially localize the drug, for example, in 
certain tissue types, such as the tissues of the reticular endothelial system (RES). A 
liposome formulation that can facilitate the association of drug with the surface of 
cells, such as, lymphocytes and macrophages is also useful. This approach may 
provide enhanced delivery of the drug to target cells by taking advantage of the 
specificity of macrophage and lymphocyte immune recognition of abnormal cells, 
such as cancer cells. 

By pharmaceutical^ acceptable formulation is meant, a composition or 
formulation that allows for the effective distribution of the nucleic acid molecules of 
the instant invention in the physical location most suitable for their desired activity. 
Non-limiting examples of agents suitable for formulation with the nucleic acid 
molecules of the instant invention include: P-glycoprotein inhibitors (such as 
Pluronic P85) which can enhance entry of drugs into the CNS (Jolliet-Riant and 
Tillement, 1999, Fundam. Clin. Pharmacol, 13, 16-26); biodegradable polymers, 
such as poly (DL-lactide-coglycolide) microspheres for sustained release delivery 
after intracerebral implantation (Emerich, DF et al, 1999, Cell Transplant, 8, 47-58) 
Alkermes, Inc. Cambridge, MA; and loaded nanoparticles, such as those made of 
polybutylcyanoacrylate, which can deliver drugs across the blood brain barrier and 
can alter neuronal uptake mechanisms (Prog Neuropsychopharmacol Biol 
Psychiatry, 23, 941-949, 1999). Other non-limiting examples of delivery strategies 
for the nucleic acid molecules of the instant invention include material described in 
Boado et al t 1998, J. Pharm. Set, 87, 1308-1315; Tyler et al, 1999, FEBS Lett, 
All, 280-284; Pardridge et al, 1995, PNAS USA., 92, 5592-5596; Boado, 1995, Adv. 
Drug Delivery Rev., 15, 73-107; Aldrian-Herrada et al., 1998, Nucleic Acids Res., 
26, 4910-4916; and Tyler et al., 1999, PNAS USA., 96, 7053-7058. 

The invention also features the use of the composition comprising surface- 
modified liposomes containing poly (ethylene glycol) lipids (PEG-modified, or long- 
circulating liposomes or stealth liposomes). These formulations offer a method for 
increasing the accumulation of drugs in target tissues. This class of drug carriers 
resists opsonization and elimination by the mononuclear phagocytic system (MPS or 
RES), thereby enabling longer blood circulation times and enhanced tissue exposure 
for the encapsulated drug (Lasic et al. Chem. Rev. 1995, 95, 2601-2627; Ishiwata et 
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al. 9 Chem. Pharm. Bull. 1995, 43, 1005-1011). All incorporated by reference herein. 
Such liposomes have been shown to accumulate selectively in tumors, presumably 
by extravasation and capture in the neovascularized target tissues (Lasic et aL, 
Science 1995, 267, 1275-1276; Oku et a/., 1995, Biochim. Biophys. Acta, 1238, 86- 
90). All incorporated by reference herein. The long-circulating liposomes enhance 
the pharmacokinetics and pharmacodynamics of DNA and RNA, particularly 
compared to conventional cationic liposomes which are known to accumulate in 
tissues of the MPS (Liu et al., J. Biol. Chem. 1995, 42, 24864-24870; Choi et aL, 
International PCT Publication No. WO 96/10391; Ansell et al., International PCT 
Publication No. WO 96/10390; Holland et al, International PCT Publication No. 
WO 96/10392; all of which are incorporated by reference herein). Long-circulating 
liposomes are also likely to protect drugs from nuclease degradation to a greater 
extent compared to cationic liposomes, based on their ability to avoid accumulation 
in metabolically aggressive MPS tissues such as the liver and spleen. 

In addition, the invention features the use of methods to deliver the nucleic 
acid molecules of the instant invention to hematopoietic cells, including monocytes 
and lymphocytes. These methods are described in detail by Hartmann et al., 1998, J. 
PhamacoL Exp. Ther., 285(2), 920-928; Kronenwett et aL, 1998, Blood, 91(3), 852- 
862; Filion and Phillips, 1997, Biochim. Biophys. Acta., 1329(2), 345-356; Ma and 
Wei, 1996, Leuk. Res., 20(1 1/12), 925-930; and Bongartz et aL, 1994, Nucleic Acids 
Research, 22(22), 4681-8. Such methods, as described above, include the use of free 
oligonucleotide, cationic lipid formulations, liposome formulations including pH 
sensitive liposomes and immunoliposomes, and bioconjugates including 
oligonucleotides conjugated to fusogenic peptides, for the transfection of 
hematopoietic cells with oligonucleotides. 

The present invention also includes compositions prepared for storage or 
administration which include a pharmaceutically effective amount of the desired 
compounds in a pharmaceutically acceptable carrier or diluent. Acceptable carriers 
or diluents for therapeutic use are well known in the pharmaceutical art, and are 
described, for example, in Remington's Pharmaceutical Sciences, Mack Publishing 
Co. (A.R. Gennaro edit. 1985) hereby incorporated by reference herein. For 
example, preservatives, stabilizers, dyes and flavoring agents may be provided. 
These include sodium benzoate, sorbic acid and esters of /?-hydroxybenzoic acid. In 
addition, antioxidants and suspending agents may be used. 

A pharmaceutically effective dose is that dose required to prevent, inhibit the 
occurrence, or treat (alleviate a symptom to some extent, preferably all of the 
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symptoms) of a disease state. The pharmaceutically effective dose depends on the 
type of disease, the composition used, the route of administration, the type of 
mammal being treated, the physical characteristics of the specific mammal under 
consideration, concurrent medication, and other factors which those skilled in the 
medical arts will recognize. Generally, an amount between 0.1 mg/kg and 100 
mg/kg body weight/day of active ingredients is administered dependent upon 
potency of the negatively charged polymer. 

The nucleic acid molecules of the present invention may also be administered 
to a patient in combination with other therapeutic compounds to increase the overall 
therapeutic effect. The use of multiple compounds to treat an indication may 
increase the beneficial effects while reducing the presence of side effects. Oxygen 
therapy, bronchodilators, corticosteroids, antibacterials, vaccinations, acetylcysteine, 
mucokinetic agents, and DNase (Pulmozyme) are non-limiting examples of 
compounds and/or methods that can be combined with or used in conjunction with 
the nucleic acid molecules (e.g. ribozymes and antisense molecules) of the instant 
invention. Those skilled in the art will recognize that other drug compounds and 
therapies can be similarly and readily combined with the nucleic acid molecules of 
the instant invention (e.g. ribozymes and antisense molecules) and are, therefore, 
within the scope of the instant invention. 

Alternatively, certain of the nucleic acid molecules of the instant invention can 
be expressed within cells from eukaryotic promoters (e.g., Izant and Weintraub, 
1985, Science, 229, 345; McGarry and Lindquist, 1986, Proc. Natl Acad. ScL, USA 
83, 399; Scanlon et al. t 1991, Proc. Natl Acad. ScL USA, 88, 10591-5; Kashani- 
Sabet et al, 1992, Antisense Res. Dev., 2, 3-15; Dropulic et al, 1992, J. Virol, 66, 
1432-41; Weerasinghe et al, 1991, J. Virol, 65, 5531-4; Ojwang et al, 1992, 
Proc. Natl Acad. Set USA, 89, 10802-6; Chen et al, 1992, Nucleic Acids Res., 20, 
4581-9; Sarver et al, 1990 Science, 247, 1222-1225; Thompson et al, 1995, 
Nucleic Acids Res., 23, 2259; Good et al, 1997, Gene Therapy, 4, 45; all of the 
references are hereby incorporated in their totality by reference herein). Those 
skilled in the art realize that any nucleic acid can be expressed in eukaryotic cells 
from the appropriate DNA/RNA vector. The activity of such nucleic acids can be 
augmented by their release from the primary transcript by a ribozyme (Draper et al, 
PCT WO 93/23569, and Sullivan et al, PCT WO 94/02595; Ohkawa et al, 1992, 
Nucleic Acids Symp. Ser., 27, 15-6; Tairae/a/., 1991, Nucleic Acids Res., 19, 5125- 
30; Ventura et al, 1993, Nucleic Acids Res., 21, 3249-55; Chowrira et al, 1994, J. 
Biol Chem., 269, 25856; all of these references are hereby incorporated in their 
totalities by reference herein). 
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In another aspect of the invention, RNA molecules of the present invention are 
preferably expressed from transcription units (see, for example, Couture et aL, 1996, 
TIG., 12, 510) inserted into DNA or RNA vectors. The recombinant vectors are 
preferably DNA plasmids or viral vectors. Ribozyme expressing viral vectors could 
be constructed based on, but not limited to, adeno-associated virus, retrovirus, 
adenovirus, or alphavirus. Preferably, the recombinant vectors capable of expressing 
the nucleic acid molecules are delivered as described above, and persist in target 
cells. Alternatively, viral vectors may be used that provide for transient expression 
of nucleic acid molecules. Such vectors might be repeatedly administered as 
necessary. Once expressed, the nucleic acid molecule binds to the target mRNA. 
Delivery of nucleic acid molecule expressing vectors could be systemic, such as by 
intravenous or intra-muscular administration, by administration to target cells ex- 
planted from the patient followed by reintroduction into the patient, or by any other 
means that would allow for introduction into the desired target cell (for a review, 
see Couture et aL, 1996, TIG., 12, 510). 

In one aspect, the invention features an expression vector comprising a nucleic 
acid sequence encoding at least one of the nucleic acid molecules disclosed in the 
instant invention. The nucleic acid sequence encoding the nucleic acid molecule of 
the instant invention is operably linked in a manner which allows expression of that 
nucleic acid molecule. 

In another aspect, the invention features an expression vector comprising: a) a 
transcription initiation region {e.g., eukaryotic pol I, II or III initiation region); b) a 
transcription termination region (e.g., eukaryotic pol I, II or JH termination region); 
c) a nucleic acid sequence encoding at least one of the nucleic acid catalyst of the 
instant invention; and wherein said sequence is operably linked to said initiation 
region and said termination region, in a manner which allows expression and/or 
delivery of said nucleic acid molecule. The vector may optionally include an open 
reading frame (ORF) for a protein operably linked on the 5' side or the 3 f -side of the 
sequence encoding the nucleic acid catalyst of the invention; and/or an intron 
(intervening sequences). 

Transcription of the nucleic acid molecule sequences are driven from a 
promoter for eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or 
RNA polymerase HI (pol IH). Transcripts from pol II or pol III promoters will be 
expressed at high levels in all cells; the levels of a given pol II promoter in a given 
cell type will depend on the nature of the gene regulatory sequences (enhancers, 
silencers, etc.) present nearby. Prokaryotic RNA polymerase promoters are also 
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used, providing that the prokaryotic RNA polymerase enzyme is expressed in the 
appropriate cells (Elroy-Stein and Moss, 1990, Proc. Natl. Acad. Sci. USA, 87, 
6743-7; Gao and Huang 1993, Nucleic Acids Res.., 21, 2867-72; Lieber et al, 
1993, Methods Enzymol, 217, 47-66; Zhou et al, 1990, Mol. Cell. Biol, 10, 4529- 
37). All of these references are incorporated by reference herein. 

Several investigators have demonstrated that nucleic acid molecules, such as 
ribozymes expressed from such promoters can function in mammalian cells (e.g. 
Kashani-Sabet et al, 1992, Antisense Res. Dev., 2, 3-15; Ojwang et al, 1992, 
Proc. Natl Acad. Sci. USA, 89, 10802-6; Chen et al, 1992, Nucleic Acids Res., 20, 
4581-9; Yuetal, 1993, Proc. Natl Acad. Sci. USA, 90, 6340-4; L'Huillier et al, 
1992, EMBOJ., 11, 4411-8; Lisziewicz et al, 1993, Proc. Natl. Acad. Sci. U. S. A, 
90, 8000-4; Thompson et al, 1995, Nucleic Acids Res., 23, 2259; and Sullenger & 
Cech, 1993, Science, 262, 1566). More specifically, transcription units such as the 
ones derived from genes encoding U6 small nuclear (snRNA), transfer RNA (tRNA) 
and adenovirus VA RNA are useful in generating high concentrations of desired 
RNA molecules such as ribozymes in cells (Thompson et al, supra; Couture and 
Stinchcomb, 1996, supra; Noonberg et al, 1994, Nucleic Acid Res., 22, 2830; 
Noonberg et al, US Patent No. 5,624,803; Good et al, 1997, Gene Ther., 4, 45; and 
Beigelman et al, International PCT Publication No. WO 96/18736; all of these 
publications are incorporated by reference herein. The above ribozyme transcription 
units can be incorporated into a variety of vectors for introduction into mammalian 
cells, including but not restricted to, plasmid DNA vectors, viral DNA vectors (such 
as adenovirus or adeno-associated virus vectors), or viral RNA vectors (such as 
retroviral or alphavirus vectors) (for a review, see Couture and Stinchcomb, 1996, 
supra). 

In yet another aspect, the invention features an expression vector comprising a 
nucleic acid sequence encoding at least one of the nucleic acid molecules of the 
invention, in a manner which allows expression of that nucleic acid molecule. The 
expression vector comprises in one embodiment; a) a transcription initiation region; 
b) a transcription termination region; c) a nucleic acid sequence encoding at least 
one said nucleic acid molecule; and wherein said sequence is operably linked to said 
initiation region and said termination region, in a manner which allows expression 
and/or delivery of said nucleic acid molecule. 

In another preferred embodiment, the expression vector comprises: a) a 
transcription initiation region; b) a transcription termination region; c) an open 
reading frame; d) a nucleic acid sequence encoding at least one said nucleic acid 
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molecule, wherein said sequence is operably linked to the 3'-end of said open 
reading frame; and wherein said sequence is operably linked to said initiation region, 
said open reading frame and said termination region, in a manner which allows 
expression and/or delivery of said nucleic acid molecule. 

5 In yet another embodiment the expression vector comprises: a) a transcription 

initiation region; b) a transcription termination region; c) an intron; d) a nucleic acid 
sequence encoding at least one said nucleic acid molecule; and wherein said 
sequence is operably linked to said initiation region, said intron and said termination 
region, in a manner which allows expression and/or delivery of said nucleic acid 
1 0 molecule. 



In another embodiment, the expression vector comprises: a) a transcription 
initiation region; b) a transcription termination region; c) an intron; d) an open 
reading frame; e) a nucleic acid sequence encoding at least one said nucleic acid 
molecule, wherein said sequence is operably linked to the 3'-end of said open 
5j 15 reading frame; and wherein said sequence is operably linked to said initiation region, 

b said intron, said open reading frame and said termination region, in a manner which 

allows expression and/or delivery of said nucleic acid molecule. 

5 Examples. 

M= The following are non-limiting examples showing the selection, isolation, 

20 synthesis and activity of nucleic acids of the instant invention. 

The following examples demonstrate the selection and design of Antisense, 
hammerhead, DNAzyme, NCH, Amberzyme, Zinzyme, or G-Cleaver ribozyme 
molecules and binding/cleavage sites within CLCA1 RNA. 

Example 1 : Reporter System 

25 Applicant used a target discovery and target validation approach to finding 

genes that are involved in chronic mucous hypersecretion. In order to discover genes 
playing a role in the expression of mucins, a readily assayable reporter system was 
devised. The reporter system consists of a plasmid construct, termed pMUCSAC- 
EGFP, bearing a gene coding for Green Fluorescent Protein (GFP). The promoter 

30 region of the GFP gene is replaced by a portion of the Mucin SAC promoter 
sufficient to direct efficient transcription of the GFP gene. The plasmid also 
contains the neomycin drug resistance gene. 
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Example 2: Host Cell Line for Target Discovery 

The cell line selected as host for these studies, NCI-H292 (ATCC CRL-1848), 
is derived from a human lung mucoepidermoid carcinoma. The cells retain 
mucoepidermoid characteristics in culture and endogenously express mucin SAC 
5 and mucin 2. The pMUCSAC-EGFP plasmid was transfected into NCI-H292 using 
a cationic lipid formulation. Following transfection, the cells were subjected to 
limiting dilution cloning under selection by 600 jig/mL Geneticin. Cells retaining 
the pMUC5AC-EGFP plasmid survive the Geneticin treatment and form colonies 
derived from single surviving cells.The resulting clonal cell lines were screened by 
1 0 flow cytometry for the capacity to upregulate GFP production directed by the Mucin 
SAC promoter. Treating the cells with sterilized M9 bacterial medium in which 
Pseudomonas aeruginosa had been cultured (Pseudomonas conditioned medium, 
PCM) induced the mucin promoter. The PCM is supplemented with phorbol 
myristate acetate (PMA). 

15 A clonal cell line highly responsive to mucin promoter induction, designated 

H292/MUC5AC/EGFP Clone8 (H292 Clone 8) was selected as the reporter line for 
subsequent studies. The process for Target Discovery is described in Jarvis et al, 
International PCT publication No. WO 98/50530, incorporated by reference herein 
in its entirety including the Figures. 

20 Example 3: Ribozyme Library Construction 

A ribozyme library was constructed with oligonucletides containing ribozymes 
with two randomized regions comprising six -nucleotide binding "arms" (Stem I and 
Stem III of a ribozyme-substrate complex). Oligo sequence 5' and 3' of the 
ribozyme contains restriction endonuclease cleavage sites for cloning. The 3' 

25 trailing sequence forms a stem-loop for priming DNA polymerase extension to form 
a double stranded molecule. The double-stranded ribozyme library was cloned into 
the U6+27 transcription unit located in the 5' LTR region of a retroviral vector 
containing the human nerve growth factor receptor (hNGFr) reporter gene. 
Positioning the U6+27/ribozyme transcription unit in the 5' LTR results in a 

30 duplication of the transcription unit when the vector integrates into the host cell 
genome. As a result, the ribozyme is transcribed by RNA polymerase EI from 
U6+27 and by RNA polymerase II activity directed by the 5' LTR. The ribozyme 
library was packaged into retroviral particles that were used to infect and transduce 
H292 Clone 8 cells. Assay of the hNGFr reporter indicated that 50% to 60% of 

35 Clone 8 cells incorporated the ribozyme construct. Figure 5A and 5B describe the 
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generalized scheme used in the ribozyme library construction and target discovery. 
By "randomized region" is meant a region of completely random sequence and/or 
partially random sequence. By completely random sequence is meant a sequence 
wherein theoretically there is equal representation of A, T, G and C nucleotides or 
modified derivatives thereof, at each position in the sequence. By partially random 
sequence is meant a sequence wherein there is an unequal representation of A, T, G 
and C nucleotides or modified derivatives thereof, at each position in the sequence. 
A partially random sequence can therefore have one or more positions of complete 
randomness and one or more positions with defined nucleotides. 

Example 4: Enriching for Non-responders to Mucin Induction 

Sorting of ribozyme library-containing cells was performed to enrich for cells 
that produce less GFP after treatment with PCM and PMA. Lower GFP production 
may be due to ribozyme action upon genes involved in the activation of the mucin 
promoter. Alternatively, ribozymes may directly target the mucin/GFP transcript 
resulting in reduced GFP expression. 

Cells were seeded at a density of 1 x 10 6 per 150 cm 2 style cell culture flasks. 
After 72 hours the standard cell culture medium was replaced with medium without 
fetal bovine serum. After 24 hours of serum deprivation the cells were treated with 
serum-containing medium supplemented with PCM (to 40%) and PMA (to 50 nM) 
to induced GFP production via the mucin promoter. After 20 to 22 hours, cells were 
monitored for GFP level on a F AC Star Plus cell sorter. 

Sorting was performed if 90% of ribozyme library cells from an unsorted 
control sample were induced to produce GFP above background levels. Two cell 
fractions were collected in each round of sorting. 

In the initial sort the Ml gate collected cells in luminescence channels 1 to 4.5; 
those cells with the lowest GFP signal (5% of the induced population). The M2 sort 
gate collected cells in luminescence channels 4.5 to 20; cells with low GFP signal 
(10% of the induced population). The Ml and M2 fractions together represented the 
15% of the induced population responding least to the GFP induction treatment. In 
order to assure that the diversity of the ribozyme library was represented 2.3 X 10 6 
cells were collected in the Ml fraction and 4.6 x 10 6 cells were collected in the M2 
fraction. The Ml and M2 fractions wee cultured separately and representative 
portions of each were cryopreserved after each round of sorting. 
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When treated with PCM and PMA prior to a second round of sorting, cells 
from both the Ml and M2 fractions responded as before with >90% of the cells 
producing elevated levels of GFP. The same sorting criteria and sort gates were 
used in the second round. As in the first round of sorting the Ml sort gate collected 
5% of the treated cells (those with little or no GFP) and the M2 gate collected 10% 
of the cells. Two more rounds of sorting were performed using the same sorting 
criteria. 

Prior to the third round of sorting the Ml fraction showed a three- fold 
enrichment of GFP negative cells. Prior to the fourth round of sorting both the Ml 
and M2 fractions were significantly enriched in cells unresponsive to the GFP 
induction treatment. 

Following the third round of sorting the Ml fraction was selected to generate a 
database of ribozymes present in the sorted cells. 

Example 5: Recovery of Ribozyme Sequence from Sorted Cells 

Genomic DNA was obtained from sorted ribozyme library cells by standard 
methods. Nested polymerase chain reaction (PCR) primers (Sequence ID Nos. 5468 
and 5469) that hybridized to the retroviral vector 5' and 3' of the ribozyme were 
used to recover and amplify the ribozyme sequences from the Clone 8 library cell 
DNA. The PCR product was ligated into a bacterial cloning vector. Two methods 
were developed to use the recovered ribozyme library, in plasmid form, to generate a 
database of ribozyme binding arm sequences. In the first approach the library was 
cloned into E. coli. DNA was prepared by plasmid isolation from bacterial colonies 
or by direct colony PCR and ribozyme arm sequence was determined. Over 450 
sequences have been obtained by this method. A second method used the ribozyme 
library to transfect H292 Clone 8 cells. Clonal lines of stably transfected cells were 
established and induced with PCM and PMA. Those lines which failed to respond 
to GFP induction were probed by PCR for single ribozyme integration events. Over 
300 sequences were obtained in this manner. The unique ribozyme sequences 
obtained by both methods were added to a Target Sequence Tag (TST) database. 

Example 6: Bioinformatics 

After sequencing 760 recovered ribozymes 171 unique sequences were found. 
Of the unique sequences, 91 have been recovered once and 80 have been found 
multiple times. Most of the repeated sequences have been found 2 to 1 1 times. One 
sequence has been recovered 145 times. The diversity of the sequences obtained 
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indicates that the sorted cells are a promising source of information leading to target 
discovery. 

Ribozyme binding arm sequences were compared to public and private gene 
data banks. Gene matches were compiled according to perfect and imperfect 
matches. Potential gene targets were categorized by the number of different 
ribozyme sequences matching each gene. Multiple ribozyme matches have been 
found for 180 genes. Genes with more than one perfect ribozyme match were given 
close attention. A total of 34 genes have been verified to date to have multiple 
perfect ribozyme matches. Of those at least 17 have protein products of known 
function. 

Two perfect ribozyme matches were found for human calcium activated 
chloride channel- 1 (hCLCAl). Each ribozyme matches at two sites in the hCLCAl 
gene. A third sorted library ribozyme sequence "hits" hCLCAl but has a single 
nucleotide mismatch. 

Example 7: Selection of hCLCAl for Validation 

The selection of hCLCAl as a candidate for target validation was based on 
bioinformatics and on emerging data in murine models of mucous hypersecretion in 
the trachea and lung. Two ribozymes (Seq. ID Nos. 2332 and 2273) recovered from 
cells that no longer respond to mucin promoter/GFP induction match perfectly to 
hCLCAl. A third has a single mismatch. Evidence from two murine models 
indicates a correlation between mucous hypersecretion in the lung and strong 
upregulation of gob-5 (GenBank AB017156), a murine homologue of hCLCAl. 

Example 8: Validation of hCLCAl 

To validate hCLCAl as a regulator of MUC5AC expression, GeneBloc 
reagents were designed (Table IX) to the hCLCAl cDNA sequence (GenBank 
AF039400). GeneBloc reagents are complexed with a cationic lipid formulation 
prior to administration to H292/MUC5AC/GFP Clone 8 cells. Concentrations of the 
GeneBloc reagents administered range from 30 nM to 120 nM at cationic lipid 
concentrations of 4-6 ng/mL. Cells are treated with GeneBloc reagents for 72 to 96 
hours. Before the termination of GeneBloc treatment, PCM (to 40 %) and PMA (to 
50 nM) are added to induce the MUC5AC promoter. After twenty hours of 
induction the cells are harvested and assayed for phenotypic and molecular 
parameters. Reduced GFP expression in GeneBloc treated cells (measured by flow 
cytometry) is taken as evidence for validation of hCLCAl. Knockdown of hCLCAl 
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RNA in GeneBloc treated cells can correlate with reduced endogenous MUC5AC 
RNA and reduced GFP RNA (from the MUC5AC/GFP construct) to complete 
validation of hCLCAl . 

Example 9: Identification of Potential Target Sites in Human CLCA1 RNA 

5 The sequence of human CLCA1 is screened for accessible sites using a 

computer-folding algorithm. Regions of the RNA are identified that do not form 
secondary folding structures. These regions contain potential ribozyme and/or 
antisense binding/cleavage sites. The sequences of these binding/cleavage sites are 
shown in Tables III-IX. 

10 Example 10: Selection of Enzymatic Nucleic Acid Cleavage Sites in Human CLCA1 

0 RNA 

. ?~ 

Si Ribozyme target sites are chosen by analyzing sequences of Human CLCA1 

"M (GenBank accession numbers: NM_001285 and AF039400) and prioritizing the sites 

~ on the basis of folding. Ribozymes are designed that could bind each target and are 

01 15 individually analyzed by computer folding (Christoffersen et al, 1994 J. Mol Struc. 

Theochem, 311, 273; Jaeger et al, 1989, Proc. Natl. Acad. ScL USA, 86, 7706) to 
g assess whether the ribozyme sequences fold into the appropriate secondary structure. 

□ Those ribozymes with unfavorable intramolecular interactions between the binding 

J= arms and the catalytic core are eliminated from consideration. As noted below, 

20 varying binding arm lengths can be chosen to optimize activity. Generally, at least 5 

bases on each arm are able to bind to, or otherwise interact with, the target RNA. 

Example 1 1 : Chemical Synthesis and Purification of Ribozymes and Antisense for 
Efficient Cleavage and/or blocking of CLCA1 RNA 

Ribozymes and antisense constructs are designed to anneal to various sites in 
25 the RNA message. The binding arms of the ribozymes are complementary to the 
target site sequences described above, while the antisense constructs are fully 
complimentary to the target site sequences described above. The ribozymes and 
antisense constructs were chemically synthesized. The method of synthesis used 
followed the procedure for normal RNA synthesis as described above and in Usman 
30 et al, (1987 J. Am. Chem. Soc, 109, 7845), Scaringe et al, (1990 Nucleic Acids 
Res., 18, 5433) and Wincott et al, supra, and made use of common nucleic acid 
protecting and coupling groups, such as dimethoxytrityl at the 5 -end, and 
phosphoramidites at the 3 ! -end. The average stepwise coupling yields were typically 
>98%. 



42 



MBHB00-814-A (249.021) 



Ribozymes and antisense constructs are also synthesized from DNA templates 
using bacteriophage T7 RNA polymerase (Milligan and Uhlenbeck, 1989, Methods 
EnzymoL 180, 51). Ribozymes and antisense constructs are purified by gel 
electrophoresis using general methods or are purified by high pressure liquid 
chromatography (HPLC; see Wincott et aL, supra; the totality of which is hereby 
incorporated herein by reference) and are resuspended in water. The sequences of 
the chemically synthesized ribozymes and antisense constructs used in this study are 
shown below in Table III-IX. 

Indications 

Particular conditions and disease states that can be associated with CLCA1 
expression modulation include but are not limited to Chronic Obstructive Pulmonary 
Disease (COPD), chronic bronchitis, asthma, cystic fibrosis, obstructive bowel 
syndrome, and any other diseases or conditions that are related to or will respond to 
the levels of CLCA1 in a cell or tissue, alone or in combination with other therapies. 

The present body of knowledge in CLCA1 research indicates the need for 
methods to assay CLCA1 activity and for compounds that can regulate CLCA1 
expression for research, diagnostic, and therapeutic use. 

The nucleic acid molecules of the present invention may also be administered to 
a patient in combination with other therapeutic compounds to increase the overall 
therapeutic effect. The use of multiple compounds to treat an indication may 
increase the beneficial effects while reducing the presence of side effects. Oxygen 
therapy, bronchodilators, corticosteroids, antibacterials, vaccinations, acetylcysteine, 
mucokinetic agents, and DNase (Pulmozyme), are non-limiting examples of methods 
and/or treatments that can be used in combination with nucleic acid molecules of the 
invention. Those skilled in the art will recognize that other drug compounds and 
therapies can be similarly and readily combined with the nucleic acid molecules of 
the instant invention (e.g. ribozymes and antisense molecules) and are, therefore, 
within the scope of the instant invention. 

Cell Culture 

The cell culture system described in Example 8 can be used to evaluate nucleic 
acid molecules of the invention for efficacy in CLCA1 and mucin modulation. 
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Animal Models 

Numerous reports can be found which describe animal models relevant to 
disease states such as COPD and cystic fibrosis. These models can be used to 
determine efficacy of the nucleic acid molecules of the instant invention targeting 
such disease states or conditions. Animal models for chronic pulmonary disease 
(COPD) are described by Shapiro, 2000, Am. J. Respir. Cell Mol Biol, 22(1), 4-7; 
Hogg, 1998, Ika Daigaku Zasshi, 56(3), 429-432; and Garssen et al, 1997, 
Inhalation Toxicol, 9(6), 581-599. Animal models for cystic fibrosis are described 
by Kent et al, 1997, J. Clin. Invest, 100(12), 3060-3069; Hill et al t 1997, 62(1), 
113-122; Grubb and Gabriel, 1997, Am. J. Physiol, 272, G258-G266; Rozmahel, 
1996, From: Diss. Abstr. Int. B 1997, 57(8), 4863; Van Doorninck et al, 1995, 
EMBOJ., 14(18), 4403-11; and Zeiher^ al, 1995, J. Clin. Invest, 96(4), 2051-64. 

Diagnostic uses 

The nucleic acid molecules of this invention {e.g., ribozymes) may be used as 
diagnostic tools to examine genetic drift and mutations within diseased cells or to 
detect the presence of CLCA1 RNA in a cell. The close relationship between 
ribozyme activity and the structure of the target RNA allows the detection of 
mutations in any region of the molecule which alters the base-pairing and three- 
dimensional structure of the target RNA. By using multiple ribozymes described in 
this invention, one may map nucleotide changes which are important to RNA 
structure and function in vitro, as well as in cells and tissues. Cleavage of target 
RNAs with ribozymes may be used to inhibit gene expression and define the role 
(essentially) of specified gene products in the progression of disease. In this maimer, 
other genetic targets may be defined as important mediators of the disease. These 
experiments will lead to better treatment of the disease progression by affording the 
possibility of combinational therapies {e.g., multiple ribozymes targeted to different 
genes, ribozymes coupled with known small molecule inhibitors, or intermittent 
treatment with combinations of ribozymes and/or other chemical or biological 
molecules). Other in vitro uses of ribozymes of this invention are well known in the 
art, and include detection of the presence of mRNAs associated with CLCA1 -related 
condition. Such RNA is detected by determining the presence of a cleavage product 
after treatment with a ribozyme using standard methodology. 

In a specific example, ribozymes which can cleave only wild-type or mutant 
forms of the target RNA are used for the assay. The first ribozyme is used to 
identify wild-type RNA present in the sample and the second ribozyme will be used 
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to identify mutant RNA in the sample. As reaction controls, synthetic substrates of 
both wild-type and mutant RNA will be cleaved by both ribozymes to demonstrate 
the relative ribozyme efficiencies in the reactions and the absence of cleavage of the 
"non-targeted" RNA species. The cleavage products from the synthetic substrates 
5 will also serve to generate size markers for the analysis of wild-type and mutant 
RNAs in the sample population. Thus, each analysis can require two ribozymes, two 
substrates and one unknown sample, which will be combined into six reactions. The 
presence of cleavage products will be determined using an RNAse protection assay 
so that full-length and cleavage fragments of each RNA can be analyzed in one lane 

10 of a polyacrylamide gel. It is not absolutely required to quantify the results to gain 
insight into the expression of mutant RNAs and putative risk of the desired 
phenotypic changes in target cells. The expression of mRNA whose protein product 
is implicated in the development of the phenotype (i.e., CLCA1) is adequate to 
establish risk. If probes of comparable specific activity are used for both transcripts, 

1 5 then a qualitative comparison of RNA levels will be adequate and will decrease the 
cost of the initial diagnosis. Higher mutant form to wild-type ratios will be 
correlated with higher risk whether RNA levels are compared qualitatively or 
quantitatively. 

Additional Uses 

20 Potential usefulness of sequence-specific enzymatic nucleic acid molecules of 

the instant invention might have many of the same applications for the study of RNA 
that DNA restriction endonucleases have for the study of DNA (Nathans et aL, 1975 
Ann. Rev. Biochem. 44:273). For example, the pattern of restriction fragments could 
be used to establish sequence relationships between two related RNAs, and large 

25 RNAs could be specifically cleaved to fragments of a size more useful for study. The 
ability to engineer sequence specificity of the enzymatic nucleic acid molecule is 
ideal for cleavage of RNAs of unknown sequence. Applicant describes the use of 
nucleic acid molecules to down-regulate gene expression of target genes in bacterial, 
microbial, fungal, viral, and eukaryotic systems including plant, or mammalian cells. 

30 All patents and publications mentioned in the specification are indicative of the 

levels of skill of those skilled in the art to which the invention pertains. All 
references cited in this disclosure are incorporated by reference to the same extent as 
if each reference had been incorporated by reference in its entirety individually. 

One skilled in the art would readily appreciate that the present invention is well 
35 adapted to carry out the objects and obtain the ends and advantages mentioned, as 
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well as those inherent therein. The methods and compositions described herein as 
presently representative of preferred embodiments are exemplary and are not 
intended as limitations on the scope of the invention. Changes therein and other 
uses will occur to those skilled in the art, which are encompassed within the spirit of 
5 the invention, are defined by the scope of the claims. 

It will be readily apparent to one skilled in the art that varying substitutions and 
modifications may be made to the invention disclosed herein without departing from 
the scope and spirit of the invention. Thus, such additional embodiments are within 
the scope of the present invention and the following claims. 

1 0 The invention illustratively described herein suitably may be practiced in the 

absence of any element or elements, limitation or limitations which is not 
specifically disclosed herein. Thus, for example, in each instance herein any of the 
terms "comprising", "consisting essentially of and "consisting of may be replaced 
with either of the other two terms. The terms and expressions which have been 
1 5 employed are used as terms of description and not of limitation, and there is no 
intention that in the use of such terms and expressions of excluding any equivalents 
of the features shown and described or portions thereof, but it is recognized that 
various modifications are possible within the scope of the invention claimed. Thus, 
MB it should be understood that although the present invention has been specifically 

20 disclosed by preferred embodiments, optional features, modification and variation of 
the concepts herein disclosed may be resorted to by those skilled in the art, and that 
such modifications and variations are considered to be within the scope of this 
invention as defined by the description and the appended claims. 

In addition, where features or aspects of the invention are described in terms of 
25 Markush groups or other grouping of alternatives, those skilled in the art will 
recognize that the invention is also thereby described in terms of any individual 
member or subgroup of members of the Markush group or other group. 



Other embodiments are within the following claims. 
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TABLE I 

Characteristics of naturally occurring ribozymes 
Group I Introns 

Size: -150 to >1000 nucleotides. 

• Requires a U in the target sequence immediately 5' of the cleavage site. 

• Binds 4-6 nucleotides at the S'-side of the cleavage site. 

• Reaction mechanism: attack by the 3' -OH of guanosine to generate cleavage 
products with 3' -OH and 5'-guanosine. 

• Additional protein cofactors required in some cases to help folding and 
maintainance of the active structure. 

• Over 300 known members of this class. Found as an intervening sequence in 
Tetrahymena thermophila rRNA, fungal mitochondria, chloroplasts, phage T4, 
blue-green algae, and others. 

• Major structural features largely established through phylogenetic 
comparisons, mutagenesis, and biochemical studies [*,"]. 

• Complete kinetic framework established for one ribozyme [ m , iv , v , vi ]. 

• Studies of ribozyme folding and substrate docking underway [ vii , vii y x ]. 

• Chemical modification investigation of important residues well established 
[V 1 ]. 

• The small (4-6 nt) binding site may make this ribozyme too non-specific for 
targeted RNA cleavage, however, the Tetrahymena group I intron has been 
used to repair a "defective" D-galactosidase message by the ligation of new □- 
galactosidase sequences onto the defective message [ xii ]. 

RNAse P RNA (Ml RNA) 



Size: -290 to 400 nucleotides. 

• RNA portion of a ubiquitous ribonucleoprotein enzyme. 

• Cleaves tRNA precursors to form mature tRNA [ xiii ]. 

• Reaction mechanism: possible attack by M -OH to generate cleavage products 
with 3' -OH and 5' -phosphate. 

• RNAse P is found throughout the prokaryotes and eukaryotes. The RNA 
subunit has been sequenced from bacteria, yeast, rodents, and primates. 

• Recruitment of endogenous RNAse P for therapeutic applications is possible 
through hybridization of an External Guide Sequence (EGS) to the target RNA 

Jxiv^xvJ 

• Important phosphate and 2' OH contacts recently identified [xvi/vii] 
Group II Introns 



Size: >1000 nucleotides. 

Trans cleavage of target RNAs recently demonstrated [xvm /X ix] 




Sequence requirements not fully determined. 

Reaction mechanism: Z-OH of an internal adenosine generates cleavage 
products with 3' -OH and a "lariat" RNA containing a 3' -5' and a 2' -5' branch 
point. 

Only natural ribozyme with demonstrated participation in DNA cleavage 
[xx xxi] j n addition to RNA cleavage and ligation. 

Major structural features largely established through phylogenetic comparisons 

JxxiiJ 

Important 2' OH contacts beginning to be identified [ xxiU ] 
Kinetic framework under development [ xxiv ] 



Neurospora VS RNA 

Size: -144 nucleotides. 

Trans cleavage of hairpin target RNAs recently demonstrated [ xxv ]. 
Sequence requirements not fully determined. 

Reaction mechanism: attack by 2'-OH 5' to the scissile bond to generate 
cleavage products with 2',3'-cyclic phosphate and 5'-OH ends. 
Binding sites and structural requirements not fully determined. 
Only 1 known member of this class. Found in Neurospora VS RNA. 

Hammerhead Ribozyme 

(see text for references) 

• Size: -13 to 40 nucleotides. 

• Requires the target sequence UH immediately 5' of the cleavage site. 

• Binds a variable number nucleotides on both sides of the cleavage site. 

• Reaction mechanism: attack by 2'-OH 5' to the scissile bond to generate 
cleavage products with 2 , ,3 , -cyclic phosphate and 5'-OH ends. 

• 14 known members of this class. Found in a number of plant pathogens 
(virusoids) that use RNA as the infectious agent. 

• Essential structural features largely defined, including 2 crystal structures 

Jxxvi^xxviiJ 

• Minimal ligation activity demonstrated (for engineering through in vitro 
selection) [xxviii] 

• Complete kinetic framework established for two or more ribozymes [ xxix ]. 

• Chemical modification investigation of important residues well established 
[ xxx ] . 




Hairpin Ribozyme 

• Size: -50 nucleotides. 

• Requires the target sequence GUC immediately 3' of the cleavage site. 

• Binds 4-6 nucleotides at the S'-side of the cleavage site and a variable number to 
the 3'-side of the cleavage site. 

• Reaction mechanism: attack by 2'-OH 5' to the scissile bond to generate 
cleavage products with 2',3'-cyclic phosphate and 5'-OH ends. 
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• 3 known members of this class. Found in three plant pathogen (satellite RNAs 
of the tobacco ringspot virus, arabis mosaic virus and chicory yellow mottle 
virus) which uses RNA as the infectious agent. 

• Essential structural features largely defined [xxxi^xxii xxxiii^xxiv] 

• Ligation activity (in addition to cleavage activity) makes ribozyme amenable to 
engineering through in vitro selection [ xx * v ] 

• Complete kinetic framework established for one ribozyme [xxxvi] 

• Chemical modification investigation of important residues begun [xxxvu xxxviii] 

Hepatitis Delta Virus (HDV) Ribozyme 

• Size: ~60 nucleotides. 

• Trans cleavage of target RNAs demonstrated [xxxix] 

• Binding sites and structural requirements not fully determined, although no 
sequences 5' of cleavage site are required. Folded ribozyme contains a 
pseudoknot structure [ x1 ]. 

• Reaction mechanism: attack by 2' -OH 5' to the scissile bond to generate 
cleavage products with 2' / 3'-cyclic phosphate and 5'-OH ends. 

• Only 2 known members of this class. Found in human HDV. 

• Circular form of HDV is active and shows increased nuclease stability [ xli ] 
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Table III: Humana CLCA1 Hairminnteirlhieadl Ribozyme amid Target Sequneimce 249.021 



Pos 


Substrate 


Seq ID 
No. 


Ribozyme 


Rz 
Seq ID 
No. 


11 


CUAAUGCU U UUGGUACA 


1 


UGUACCAA CUGAUGAG GCCGUUAGGC CGAA AGCAUUAG 


2190 


12 


UAAUGCUU U UGGUACAA 


2 


UUGUACCA CUGAUGAG GCCGUUAGGC CGAA AAGCAUUA 


2191 


13 


AAUGCUUU U GGUACAAA 


3 


UUUGUACC CUGAUGAG GCCGUUAGGC CGAA AAAGCAUU 


2192 


17 


CUUUUGGU A CAAAUGGA 


4 


UCCAUUUG CUGAUGAG GCCGUUAGGC CGAA ACCAAAAG 


2193 


34 


UGUGGAAU A UAAUUGAA 


5 


UUCAAUUA CUGAUGAG GCCGUUAGGC CGAA AUUCCACA 


2194 


36 


UGGAAUAU A AUUGAAUA 


6 


UAUUCAAU CUGAUGAG GCCGUUAGGC CGAA AUAUUCCA 


2195 


39 


AAUAUAAU U GAAUAUUU 


7 


AAAUAUUC CUGAUGAG GCCGUUAGGC CGAA AUUAUAUU 


2196 


44 


AAUUGAAU A UUUUCUUG 


8 


CAAGAAAA CUGAUGAG GCCGUUAGGC CGAA AUUCAAUU 


2197 


46 


UUGAAUAU U UUCUUGUU 


9 


AACAAGAA CUGAUGAG GCCGUUAGGC CGAA AUAUUCAA 


2198 


47 


UGAAUAUU U UCUUGUUU 


10 


AAACAAGA CUGAUGAG GCCGUUAGGC CGAA AAUAUUCA 


2199 


48 


GAAUAUUU U CUUGUUUA 


11 


UAAACAAG CUGAUGAG GCCGUUAGGC CGAA AAAUAUUC 


2200 


49 


AAUAUUUU C UUGUUUAA 


12 


UUAAACAA CUGAUGAG GCCGUUAGGC CGAA AAAAUAUU 


2201 


51 


UAUUUUCU U GUUUAAGG 


13 


CCUUAAAC CUGAUGAG GCCGUUAGGC CGAA AGAAAAUA 


2202 


54 


UUUCUUGU U UAAGGGGA 


14 


UCCCCUUA CUGAUGAG GCCGUUAGGC CGAA ACAAGAAA 


2203 


55 


UUCUUGUU U AAGGGGAG 


15 


CUCCCCUU CUGAUGAG GCCGUUAGGC CGAA AACAAGAA 


2204 


56 


UCUUGUUU A AGGGGAGC 


16 


GCUCCCCU CUGAUGAG GCCGUUAGGC CGAA AAACAAGA 


2205 


77 


AGAGGUGU U GAGGUUAU 


17 


AUAACCUC CUGAUGAG GCCGUUAGGC CGAA ACACCUCU 


2206 


83 


GUUGAGGU U AUGUCAAG 


18 


CUUGACAU CUGAUGAG GCCGUUAGGC CGAA ACCUCAAC 


2207 


84 


UUGAGGUU A UGUCAAGC 


19 


GCUUGACA CUGAUGAG GCCGUUAGGC CGAA AACCUCAA 


2208 


88 


GGUUAUGU C AAGCAUCU 


20 


AGAUGCUU CUGAUGAG GCCGUUAGGC CGAA ACAUAACC 


2209 


95 


UCAAGCAU C UGGCACAG 


21 


CUGUGCCA CUGAUGAG GCCGUUAGGC CGAA AUGCUUGA 


2210 


122 


AUGGAAAU A UUUACAAG 


22 


CUUGUAAA CUGAUGAG GCCGUUAGGC CGAA AUUUCCAU 


2211 


124 


GGAAAUAU U UACAAGUA 


23 


UACUUGUA CUGAUGAG GCCGUUAGGC CGAA AUAUUUCC 


2212 


125 


GAAAUAUU U ACAAGUAC 


24 


GUACUUGU CUGAUGAG GCCGUUAGGC CGAA AAUAUUUC 


2213 


126 


AAAUAUUU A CAAGUACG 


25 


CGUACUUG CUGAUGAG GCCGUUAGGC CGAA AAAUAUUU 


2214 


132 


UUACAAGU A CGCAAUUU 


26 


AAAUUGCG CUGAUGAG GCCGUUAGGC CGAA ACUUGUAA 


2215 


139 


UACGCAAU U UGAGACUA 


27 


UAGUCUCA CUGAUGAG GCCGUUAGGC CGAA AUUGCGUA 


2216 


140 


ACGCAAUU U GAGACUAA 


28 


UUAGUCUC CUGAUGAG GCCGUUAGGC CGAA AAUUGCGU 


2217 


147 


UUGAGACU A AGAUAUUG 


29 


CAAUAUCU CUGAUGAG GCCGUUAGGC CGAA AGUCUCAA 


2218 


152 


ACUAAGAU A UUGUUAUC 


30 


GAUAACAA CUGAUGAG GCCGUUAGGC CGAA AUCUUAGU 


2219 


154 


UAAGAUAU U GUUAUCAU 


31 


AUGAUAAC CUGAUGAG GCCGUUAGGC CGAA AUAUCUUA 


2220 


157 


GAUAUUGU U AUCAUUCU 


32 


AGAAUGAU CUGAUGAG GCCGUUAGGC CGAA ACAAUAUC 


2221 


158 


AUAUUGUU A UCAUUCUC 


33 


GAGAAUGA CUGAUGAG GCCGUUAGGC CGAA AACAAUAU 


2222 


160 


AUUGUUAU C AUUCUCCU 


34 


AGGAGAAU CUGAUGAG GCCGUUAGGC CGAA AUAACAAU 


2223 


163 


GUUAUCAU U CUCCUAUU 


35 


AAUAGGAG CUGAUGAG GCCGUUAGGC CGAA AUGAUAAC 


2224 


164 


UUAUCAUU C UCCUAUUG 


36 


CAAUAGGA CUGAUGAG GCCGUUAGGC CGAA AAUGAUAA 


2225 


166 


AUCAUUCU C CUAUUGAA 


37 


UUCAAUAG CUGAUGAG GCCGUUAGGC CGAA AGAAUGAU 


2226 


169 


AUUCUCCU A UUGAAGAC 


38 


GUCUUCAA CUGAUGAG GCCGUUAGGC CGAA AGGAGAAU 


2227 


171 


UCUCCUAU U GAAGACAA 


39 


UUGUCUUC CUGAUGAG GCCGUUAGGC CGAA AUAGGAGA 


2228 


187 


AGAGCAAU A GUAAAACA 


40 


UGUUUUAC CUGAUGAG GCCGUUAGGC CGAA AUUGCUCU 


2229 


190 


GCAAUAGU A AAACACAU 


41 


AUGUGUUU CUGAUGAG GCCGUUAGGC CGAA ACUAUUGC 


2230 


199 


AAACACAU C AGGUCAGG 


42 


CCUGACCU CUGAUGAG GCCGUUAGGC CGAA AUGUGUUU 


2231 


204 


CAUCAGGU C AGGGGGUU 


43 


AACCCCCU CUGAUGAG GCCGUUAGGC CGAA ACCUGAUG 


2232 


212 


CAGGGGGU U AAAGACCU 


44 


AGGUCUUU CUGAUGAG GCCGUUAGGC CGAA ACCCCCUG 


2233 


213 


AGGGGGUU A AAGACCUG 


45 


CAGGUCUU CUGAUGAG GCCGUUAGGC CGAA AACCCCCU 


2234 


226 


CCUGUGAU A AACCACUU 


46 


AAGUGGUU CUGAUGAG GCCGUUAGGC CGAA AUCACAGG 


2235 


234 


AAACCACU U CCGAUAAG 


47 


CUUAUCGG CUGAUGAG GCCGUUAGGC CGAA AGUGGUUU 


2236 


235 


AACCACUU C CGAUAAGU 


48 


ACUUAUCG CUGAUGAG GCCGUUAGGC CGAA AAGUGGUU 


2237 


240 


CUUCCGAU A AGUUGGAA 


49 


UUCCAACU CUGAUGAG GCCGUUAGGC CGAA AUCGGAAG 


2238 


244 


CGAUAAGU U GGAAACGU 


50 


ACGUUUCC CUGAUGAG GCCGUUAGGC CGAA ACUUAUCG 


2239 


257 


ACGUGUGU C UAUAUUUU 


51 


AAAAUAUA CUGAUGAG GCCGUUAGGC CGAA ACACACGU 


2240 


259 


GUGUGUCU A UAUUUUCA 


52 


UGAAAAUA CUGAUGAG GCCGUUAGGC CGAA AGACACAC 


2241 



54 



261 


GUGUCUAU 


A 


UUUUCAUA 


53 


UAUGAAAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAGACAC 


2242 


263 


GUCUAUAU 


U 


UUCAUAUC 


54 


GAUAUGAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAUAGAC 


2243 


264 


UCUAUAUU 


U 


UCAUAUCU 


55 


AGAUAUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUAUAGA 


2244 


265 


CUAUAUUU 


U 


CAUAUCUG 


56 


CAGAUAUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAUAUAG 


2245 


266 


UAUAUUUU 


C 


AUAUCUGU 


57 


ACAGAUAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAAUAUA 


2246 


269 


AUUUUCAU 


A 


UCUGUAUA 


58 


UAUACAGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAAAAU 


2247 


271 


UUUCAUAU 


C 


UGUAUAUA 


59 


UAUAUACA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAUGAAA 


2248 


275 


AUAUCUGU 


A 


UAUAUAUA 


60 


UAUAUAUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAGAUAU 


2249 


111 


AUCUGUAU 


A 


UAUAUAAU 


61 


AUUAUAUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUACAGAU 


2250 


IIS 


CUGUAUAU 


A 


UAUAAUGG 


62 


CCAUUAUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAUACAG 


2251 


281 


GUAUAUAU 


A 


UAAUGGUA 


63 


UACCAUUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAUAUAC 


2252 


283 


AUAUAUAU 


A 


AUGGUAAA 


64 


UUUACCAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAUAUAU 


2253 


289 


AUAAUGGU 


A 


AAGAAAGA 


65 


UCUUUCUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACCAUUAU 


2254 


303 


AGACACCU 


U 


CGUAACCC 


66 


GGGUUACG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGUGUCU 


2255 


304 


GACACCUU 


C 


GUAACCCG 


67 


CGGGUUAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGGUGUC 


2256 


307 


ACCUUCGU 


A 


ACCCGCAU 


68 


AUGCGGGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACGAAGGU 


2257 


316 


ACCCGCAU 


U 


UUCCAAAG 


69 


CUUUGGAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGCGGGU 


2258 


317 


CCCGCAUU 


U 


UCCAAAGA 


70 


UCUUUGGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGCGGG 


2259 


318 


CCGCAUUU 


U 


CCAAAGAG 


71 


CUCUUUGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAUGCGG 


2260 


319 


CGCAUUUU 


C 


CAAAGAGA 


72 


UCUCUUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAAUGCG 


2261 


333 


AGAGGAAU 


C 


ACAGGGAG 


73 


CUCCCUGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUCCUCU 


2262 


346 


GGAGAUGU 


A 


CAGCAAUG 


74 


CAUUGCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAUCUCC 


2263 


362 


GGGGCCAU 


U 


UAAGAGUU 


75 


AACUCUUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGGCCCC 


2264 


363 


GGGCCAUU 


U 


AAGAGUUC 


76 


GAACUCUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGGCCC 


2265 


364 


GGCCAUUU 


A 


AGAGUUCU 


77 


AGAACUCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAUGGCC 


2266 


370 


UUAAGAGU 


U 


CUGUGUUC 


78 


GAACACAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUCUUAA 


2267 


371 


UAAGAGUU 


C 


UGUGUUCA 


79 


UGAACACA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACUCUUA 


2268 


377 


UUCUGUGU 


U 


CAUCUUGA 


80 


UCAAGAUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACACAGAA 


2269 


378 


UCUGUGUU 


C 


AUCUUGAU 


81 


AUCAAGAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACACAGA 


2270 


381 


GUGUUCAU 


C 


UUGAUUCU 


82 


AGAAUCAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAACAC 


2271 


383 


GUUCAUCU 


U 


GAUUCUUC 


83 


GAAGAAUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAUGAAC 


2272 


387 


AUCUUGAU 


u 


CUUCACCU 


84 


AGGUGAAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCAAGAU 


2273 


388 


UCUUGAUU 


C 


UUCACCUU 


85 


AAGGUGAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUCAAGA 


2274 


390 


UUGAUUCU 


u 


CACCUUCU 


86 


AGAAGGUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAAUCAA 


2275 


391 


UGAUUCUU 


C 


ACCUUCUA 


87 


UAGAAGGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGAAUCA 


2276 


396 


CUUCACCU 


u 


CUAGAAGG 


88 


CCUUCUAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGUGAAG 


2277 


397 


UUCACCUU 


C 


UAGAAGGG 


89 


CCCUUCUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGGUGAA 


2278 


399 


CACCUUCU 


A 


GAAGGGGC 


90 


GCCCCUUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAAGGUG 


2279 


415 


CCCUGAGU 


A 


AUUCACUC 


91 


GAGUGAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUCAGGG 


2280 


418 


UGAGUAAU 


U 


CACUCAUU 


92 


AAUGAGUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUACUCA 


2281 


419 


GAGUAAUU 


C 


ACUCAUUC 


93 


GAAUGAGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUUACUC 


2282 


423 


AAUUCACU 


C 


AUUCAGCU 


94 


AGCUGAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUGAAUU 


2283 


426 


UCACUCAU 


U 


CAGCUGAA 


95 


UUCAGCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAGUGA 


2284 


427 


CACUCAUU 


C 


AGCUGAAC 


96 


GUUCAGCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGAGUG 


2285 


446 


CAAUGGCU 


A 


UGAAGGCA 


97 


UGCCUUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCCAUUG 


2286 


456 


GAAGGCAU 


U 


GUCGUUGC 


98 


GCAACGAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGCCUUC 


2287 


459 


GGCAUUGU 


C 


GUUGCAAU 


99 


AUUGCAAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAAUGCC 


2288 


462 


AUUGUCGU 


u 


GCAAUCGA 


100 


UCGAUUGC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACGACAAU 


2289 


468 


GUUGCAAU 


C 


GACCCCAA 


101 


UUGGGGUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUGCAAC 


2290 


498 


GAAACACU 


C 


AUUCAACA 


102 


UGUUGAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUGUUUC 


2291 


501 


ACACUCAU 


u 


CAACAAAU 


103 


AUUUGUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAGUGU 


2292 


502 


CACUCAUU 


C 


AACAAAUA 


104 


UAUUUGUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGAGUG 


2293 


510 


CAACAAAU 


A 


AAGGACAU 


105 


AUGUCCUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUGUUG 


2294 


533 


CCAGGCAU 


C 


UCUGUAUC 


106 


GAUACAGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGCCUGG 


2295 


535 


AGGCAUCU 


C 


UGUAUCUG 


107 


CAGAUACA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAUGCCU 


2296 


539 


AUCUCUGU 


A 


UCUGUUUG 


108 


CAAACAGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAGAGAU 


2297 



55 



541 


CUCUGUAU 


C 


UGUUUGAA 


109 


UUCAAACA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUACAGAG 


2298 


545 


GUAUCUGU 


U 


UGAAGCUA 


110 


UAGCUUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAGAUAC 


2299 


546 


UAUCUGUU 


U 


GAAGCUAC 


111 


GUAGCUUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACAGAUA 


2300 


553 


UUGAAGCU 


A 


CAGGAAAG 


112 


CUUUCCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCUUCAA 


2301 


566 


AAAGCGAU 


U 


UUAUUUCA 


113 


UGAAAUAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCGCUUU 


2302 


567 


AAGCGAUU 


U 


UAUUUCAA 


114 


UUGAAAUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUCGCUU 


2303 


568 


AGCGAUUU 


U 


AUUUCAAA 


115 


UUUGAAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAUCGCU 


2304 


569 


GCGAUUUU 


A 


UUUCAAAA 


116 


UUUUGAAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAAUCGC 


2305 


571 


GAUUUUAU 


U 


UCAAAAAU 


117 


AUUUUUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAAAAUC 


2306 


572 


AUUUUAUU 


U 


CAAAAAUG 


118 


CAUUUUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUAAAAU 


2307 


573 


UUUUAUUU 


C 


AAAAAUGU 


119 


ACAUUUUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAUAAAA 


2308 


582 


AAAAAUGU 


u 


GCCAUUUU 


120 


AAAAUGGC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAUUUUU 


2309 


588 


GUUGCCAU 


u 


UUGAUUCC 


121 


GGAAUCAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGGCAAC 


2310 


589 


UUGCCAUU 


u 


UGAUUCCU 


122 


AGGAAUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGGCAA 


2311 


590 


UGCCAUUU 


u 


GAUUCCUG 


123 


CAGGAAUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAUGGCA 


2312 


594 


AUUUUGAU 


u 


CCUGAAAC 


124 


GUUUCAGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCAAAAU 


2313 


595 


UUUUGAUU 


c 


CUGAAACA 


125 


UGUUUCAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUCAAAA 


2314 


623 


GGCUGACU 


A 


UGUGAGAC 


126 


GUCUCACA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUCAGCC 


2315 


639 


CCAAAACU 


U 


GAGACCUA 


127 


UAGGUCUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUUUUGG 


2316 


647 


UGAGACCU 


A 


CAAAAAUG 


128 


CAUUUUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGUCUCA 


2317 


663 


GCUGAUGU 


U 


CUGGUUGC 


129 


GCAACCAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAUCAGC 


2318 


664 


CUGAUGUU 


C 


UGGUUGCU 


130 


AGCAACCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACAUCAG 


2319 


669 


GUUCUGGU 


u 


GCUGAGUC 


131 


GACUCAGC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACCAGAAC 


2320 


677 


UGCUGAGU 


c 


UACUCCUC 


132 


GAGGAGUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUCAGCA 


2321 


679 


CUGAGUCU 


A 


CUCCUCCA 


133 


UGGAGGAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGACUCAG 


2322 


682 


AGUCUACU 


c 


CUCCAGGU 


134 


ACCUGGAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUAGACU 


2323 


685 


CUACUCCU 


c 


CAGGUAAU 


135 


AUUACCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGAGUAG 


2324 


691 


CUCCAGGU 


A 


AUGAUGAA 


136 


UUCAUCAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACCUGGAG 


2325 


704 


UGAACCCU 


A 


CACUGAGC 


137 


GCUCAGUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGGUUCA 


2326 


747 


GAAAGGAU 


C 


CACCUCAC 


138 


GUGAGGUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCCUUUC 


2327 


753 


AUCCACCU 


C 


ACUCCUGA 


139 


UCAGGAGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGUGGAU 


2328 


757 


ACCUCACU 


C 


CUGAUUUC 


140 


GAAAUCAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUGAGGU 


2329 


763 


CUCCUGAU 


U 


UCAUUGCA 


141 


UGCAAUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCAGGAG 


2330 


764 


UCCUGAUU 


U 


CAUUGCAG 


142 


CUGCAAUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUCAGGA 


2331 


765 


CCUGAUUU 


C 


AUUGCAGG 


143 


CCUGCAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAUCAGG 


2332 


768 


GAUUUCAU 


U 


GCAGGAAA 


144 


UUUCCUGC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAAAUC 


2333 


782 


AAAAAAGU 


U 


AGCUGAAU 


145 


AUUCAGCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUUUUUU 


2334 


783 


AAAAAGUU 


A 


GCUGAAUA 


146 


UAUUCAGC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACUUUUU 


2335 


791 


AGCUGAAU 


A 


UGGACCAC 


147 


GUGGUCCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUCAGCU 


2336 


805 


CACAAGGU 


A 


AGGCAUUU 


148 


AAAUGCCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACCUUGUG 


2337 


812 


UAAGGCAU 


U 


UGUCCAUG 


149 


CAUGGACA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGCCUUA 


2338 


813 


AAGGCAUU 


U 


GUCCAUGA 


150 


UCAUGGAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGCCUU 


2339 


816 


GCAUUUGU 


C 


CAUGAGUG 


151 


CACUCAUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAAAUGC 


2340 


829 


AGUGGGCU 


C 


AUCUACGA 


152 


UCGUAGAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCCCACU 


2341 


832 


GGGCUCAU 


C 


UACGAUGG 


153 


CCAUCGUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAGCCC 


2342 


834 


GCUCAUCU 


A 


CGAUGGGG 


154 


CCCCAUCG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAUGAGC 


2343 


846 


UGGGGAGU 


A 


UUUGACGA 


155 


UCGUCAAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUCCCCA 


2344 


848 


GGGAGUAU 


U 


UGACGAGU 


156 


ACUCGUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUACUCCC 


2345 


849 


GGAGUAUU 


U 


GACGAGUA 


157 


UACUCGUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUACUCC 


2346 


857 


UGACGAGU 


A 


CAAUAAUG 


158 


CAUUAUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUCGUCA 


2347 
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A 


AUCjAUGAG 


159 


CUCAUCAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUGUACU 


2348 


875 


UGAGAAAU 


U 


CUACUUAU 


160 


AUAAGUAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUCUCA 


2349 


876 


GAGAAAUU 


C 


UACUUAUC 


161 


GAUAAGUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUUUCUC 


2350 


878 


GAAAUUCU 


A 


CUUAUCCA 


162 


UGGAUAAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAAUUUC 


2351 


881 


AUUCUACU 


U 


AUCCAAUG 


163 


CAUUGGAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUAGAAU 


2352 


882 


UUCUACUU 


A 


UCCAAUGG 


164 


CCAUUGGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGUAGAA 


2353 
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884 


CUACUUAU C CAAUGGAA 


165 


UUCCAUUG CUGAUGAG GCCGUUAGGC CGAA AUAAGUAG 


2354 


897 


GGAAGAAU A CAAGCAGU 


166 


ACUGCUUG CUGAUGAG GCCGUUAGGC CGAA AUUCUUCC 


2355 


906 


CAAGCAGU A AGAUGUUC 


167 


GAACAUCU CUGAUGAG GCCGUUAGGC CGAA ACUGCUUG 


2356 


913 


UAAGAUGU U CAGCAGGU 


168 


ACCUGCUG CUGAUGAG GCCGUUAGGC CGAA ACAUCUUA 


2357 


914 


AAGAUGUU C AGCAGGUA 


169 


UACCUGCU CUGAUGAG GCCGUUAGGC CGAA AACAUCUU 


2358 


922 


CAGCAGGU A UUACUGGU 


170 


ACCAGUAA CUGAUGAG GCCGUUAGGC CGAA ACCUGCUG 


2359 


924 


GCAGGUAU U ACUGGUAC 


171 


GUACCAGU CUGAUGAG GCCGUUAGGC CGAA AUACCUGC 


2360 


925 


CAGGUAUU A CUGGUACA 


172 


UGUACCAG CUGAUGAG GCCGUUAGGC CGAA AAUACCUG 


2361 


931 


UUACUGGU A CAAAUGUA 


173 


UACAUUUG CUGAUGAG GCCGUUAGGC CGAA ACCAGUAA 


2362 


939 


ACAAAUGU A GUAAAGAA 


174 


UUCUUUAC CUGAUGAG GCCGUUAGGC CGAA ACAUUUGU 


2363 


942 


AAUGUAGU A AAGAAGUG 


175 


CACUUCUU CUGAUGAG GCCGUUAGGC CGAA ACUACAUU 


2364 


952 


AGAAGUGU C AGGGAGGC 


176 


GCCUCCCU CUGAUGAG GCCGUUAGGC CGAA ACACUUCU 


2365 


967 


GCAGCUGU U AC AC C AAA 


177 


UUUGGUGU CUGAUGAG GCCGUUAGGC CGAA ACAGCUGC 


2366 


968 


CAGCUGUU A CACCAAAA 


178 


UUUUGGUG CUGAUGAG GCCGUUAGGC CGAA AACAGCUG 


2367 


986 


AUGCACAU U CAAUAAAG 


179 


CUUUAUUG CUGAUGAG GCCGUUAGGC CGAA AUGUGCAU 


2368 


987 


UGCACAUU C AAUAAAGU 


180 


ACUUUAUU CUGAUGAG GCCGUUAGGC CGAA AAUGUGCA 


2369 


991 


CAUUCAAU A AAGUUACA 


181 


UGUAACUU CUGAUGAG GCCGUUAGGC CGAA AUUGAAUG 


2370 


996 


AAUAAAGU U ACAGGACU 


182 


AGUCCUGU CUGAUGAG GCCGUUAGGC CGAA ACUUUAUU 


2371 


997 


AUAAAGUU A CAGGACUC 


183 


GAGUCCUG CUGAUGAG GCCGUUAGGC CGAA AACUUUAU 


2372 


1005 


ACAGGACU C UAUGAAAA 


184 


UUUUCAUA CUGAUGAG GCCGUUAGGC CGAA AGUCCUGU 


2373 


1007 


AGGACUCU A UGAAAAAG 


185 


CUUUUUCA CUGAUGAG GCCGUUAGGC CGAA AGAGUCCU 


2374 


1025 


AUGUGAGU U UGUUCUCC 


186 


GGAGAACA CUGAUGAG GCCGUUAGGC CGAA ACUCACAU 


2375 


1026 


UGUGAGUU U GUUCUCCA 


187 


UGGAGAAC CUGAUGAG GCCGUUAGGC CGAA AACUCACA 


2376 


1029 


GAGUUUGU U CUCCAAUC 


188 


GAUUGGAG CUGAUGAG GCCGUUAGGC CGAA ACAAACUC 


2377 


1030 


AGUUUGUU C UCCAAUCC 


189 


GGAUUGGA CUGAUGAG GCCGUUAGGC CGAA AACAAACU 


2378 


1032 


UUUGUUCU C CAAUCCCG 


190 


CGGGAUUG CUGAUGAG GCCGUUAGGC CGAA AGAACAAA 


2379 


1037 


UCUCCAAU C CCGCCAGA 


191 


UCUGGCGG CUGAUGAG GCCGUUAGGC CGAA AUUGGAGA 


2380 


1057 


AGAAGGCU U CUAUAAUG 


192 


CAUUAUAG CUGAUGAG GCCGUUAGGC CGAA AGCCUUCU 


2381 


1058 


GAAGGCUU C UAUAAUGU 


193 


ACAUUAUA CUGAUGAG GCCGUUAGGC CGAA AAGCCUUC 


2382 


1060 


AGGCUUCU A UAAUGUUU 


194 


AAACAUUA CUGAUGAG GCCGUUAGGC CGAA AGAAGCCU 


2383 


1062 


GCUUCUAU A AUGUUUGC 


195 


GCAAACAU CUGAUGAG GCCGUUAGGC CGAA AUAGAAGC 


2384 


1067 


UAUAAUGU U UGCACAAC 


196 


GUUGUGCA CUGAUGAG GCCGUUAGGC CGAA ACAUUAUA 


2385 


1068 


AUAAUGUU U GCACAACA 


197 


UGUUGUGC CUGAUGAG GCCGUUAGGC CGAA AACAUUAU 


2386 


1080 


CAACAUGU U GAUUCUAU 


198 


AUAGAAUC CUGAUGAG GCCGUUAGGC CGAA ACAUGUUG 


2387 


1084 


AUGUUGAU U CUAUAGUU 


199 


AACUAUAG CUGAUGAG GCCGUUAGGC CGAA AUCAACAU 


2388 


1085 


UGUUGAUU C UAUAGUUG 


200 


CAACUAUA CUGAUGAG GCCGUUAGGC CGAA AAUCAACA 


2389 


1087 


UUGAUUCU A UAGUUGAA 


201 


UUCAACUA CUGAUGAG GCCGUUAGGC CGAA AGAAUCAA 


2390 


1089 


GAUUCUAU A GUUGAAUU 


202 


AAUUCAAC CUGAUGAG GCCGUUAGGC CGAA AUAGAAUC 


2391 


1092 


UCUAUAGU U GAAUUCUG 


203 


CAGAAUUC CUGAUGAG GCCGUUAGGC CGAA ACUAUAGA 


2392 


1097 


AGUUGAAU U CUGUACAG 


204 


CUGUACAG CUGAUGAG GCCGUUAGGC CGAA AUUCAACU 


2393 


1098 


GUUGAAUU C UGUACAGA 


205 


UCUGUACA CUGAUGAG GCCGUUAGGC CGAA AAUUCAAC 


2394 


1102 


AAUUCUGU A CAGAACAA 


206 


UUGUUCUG CUGAUGAG GCCGUUAGGC CGAA ACAGAAUU 


2395 


1129 


AAGAAGCU C CAAACAAG 


207 


CUUGUUUG CUGAUGAG GCCGUUAGGC CGAA AGCUUCUU 


2396 


1144 


AGCAAAAU C AAAAAUGC 


208 


GCAUUUUU CUGAUGAG GCCGUUAGGC CGAA AUUUUGCU 


2397 


1156 


AAUGCAAU C UCCGAAGC 


209 


GCUUCGGA CUGAUGAG GCCGUUAGGC CGAA AUUGCAUU 


2398 


1158 


UGCAAUCU C CGAAGCAC 


210 


GUGCUUCG CUGAUGAG GCCGUUAGGC CGAA AGAUUGCA 


2399 


1179 


GAAGUGAU C CGUGAUUC 


211 


GAAUCACG CUGAUGAG GCCGUUAGGC CGAA AUCACUUC 


2400 


1186 


UCCGUGAU U CUGAGGAC 


212 


GUCCUCAG CUGAUGAG GCCGUUAGGC CGAA AUCACGGA 


2401 


1187 


CCGUGAUU C UGAGGACU 


213 


AGUCCUCA CUGAUGAG GCCGUUAGGC CGAA AAUCACGG 


2402 


1196 


UGAGGACU U UAAGAAAA 


214 


UUUUCUUA CUGAUGAG GCCGUUAGGC CGAA AGUCCUCA 


2403 


1197 


GAGGACUU U AAGAAAAC 


215 


GUUUUCUU CUGAUGAG GCCGUUAGGC CGAA AAGUCCUC 


2404 


1198 


AGGACUUU A AGAAAACC 


216 


GGUUUUCU CUGAUGAG GCCGUUAGGC CGAA AAAGUCCU 


2405 


1210 


AAACCACU C CUAUGACA 


217 


UGUCAUAG CUGAUGAG GCCGUUAGGC CGAA AGUGGUUU 


2406 


1213 


CCACUCCU A UGACAACA 


218 


UGUUGUCA CUGAUGAG GCCGUUAGGC CGAA AGGAGUGG 


2407 


1234 


CACCAAAU C CCACCUUC 


219 


GAAGGUGG CUGAUGAG GCCGUUAGGC CGAA AUUUGGUG 


2408 


1241 


UCCCACCU U CUCAUUGC 


220 


GCAAUGAG CUGAUGAG GCCGUUAGGC CGAA AGGUGGGA 


2409 
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1242 


CCCACCUU 


C 


UCAUUGCU 


221 


AGCAAUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGGUGGG 


2410 


1244 


CACCUUCU 


C 


AUUGCUGC 


222 


GCAGCAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAAGGUG 


2411 


1247 


CUUCUCAU 


U 


GCUGCAGA 


223 


UCUGCAGC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAGAAG 


2412 


1257 


CUGCAGAU 


u 


GGACAAAG 


224 


CUUUGUCC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCUGCAG 


2413 


1269 


CAAAGAAU 


u 


GUGUGUUU 


225 


AAACACAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUCUUUG 


2414 


1276 


UUGUGUGU 


u 


UAGUCCUU 


226 


AAGGACUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACACACAA 


2415 


1277 


UGUGUGUU 


u 


AGUCCUUG 


227 


CAAGGACU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACACACA 


2416 


1278 


GUGUGUUU 


A 


GUCCUUGA 


228 


UCAAGGAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAACACAC 


2417 


1281 


UGUUUAGU 


C 


CUUGACAA 


229 


UUGUCAAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUAAACA 


2418 


1284 


UUAGUCCU 


U 


GACAAAUC 


230 


GAUUUGUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGACUAA 


2419 


1292 


UGACAAAU 


C 


UGGAAGCA 


231 


UGCUUCCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUGUCA 


2420 


1312 


CGACUGGU 


A 


ACCGCCUC 


232 


GAGGCGGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACCAGUCG 


2421 


1320 


AACCGCCU 


C 


AAUCGACU 


233 


AGUCGAUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGCGGUU 


2422 


1324 


GCCUCAAU 


C 


GACUGAAU 


234 


AUUCAGUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUGAGGC 


2423 


1333 


GACUGAAU 


C 


AAGCAGGC 


235 


GCCUGCUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUCAGUC 


2424 


1347 


GGCCAGCU 


u 


UUCCUGCU 


236 


AGCAGGAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCUGGCC 


2425 


1348 


GCCAGCUU 


u 


UCCUGCUG 


237 


CAGCAGGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGCUGGC 


2426 


1349 


CCAGCUUU 


u 


CCUGCUGC 


238 


GCAGCAGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAGCUGG 


2427 


1350 


CAGCUUUU 


c 


CUGCUGCA 


239 


UGCAGCAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAAGCUG 


2428 


1365 


CAGACAGU 


u 


GAGCUGGG 


240 


CCCAGCUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUGUCUG 


2429 


1376 


GCUGGGGU 


c 


CUGGGUUG 


241 


CAACCCAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACCCCAGC 


2430 


1383 


UCCUGGGU 


u 


GGGAUGGU 


242 


ACCAUCCC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACCCAGGA 


2431 


1397 


GGUGACAU 


u 


UGACAGUG 


243 


CACUGUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGUCACC 


2432 


1398 


GUGACAUU 


u 


GACAGUGC 


244 


GCACUGUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGUCAC 


2433 


1416 


GCCCAUGU 


A 


CAAAGUGA 


245 


UCACUUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAUGGGC 


2434 


1428 


AGUGAACU 


C 


AUACAGAU 


246 


AUCUGUAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUUCACU 


2435 


1431 


GAACUCAU 


A 


CAGAUAAA 


247 


UUUAUCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAGUUC 


2436 


1437 


AUACAGAU 


A 


AACAGUGG 


248 


CCACUGUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCUGUAU 


2437 


1464 


GACACACU 


c 


GCCAAAAG 


249 


CUUUUGGC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUGUGUC 


2438 


1475 


CAAAAGAU 


u 


ACCUGCAG 


250 


CUGCAGGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCUUUUG 


2439 


1476 


AAAAGAUU 


A 


CCUGCAGC 


251 


GCUGCAGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUCUUUU 


2440 


1489 


CAGCAGCU 


u 


CAGGAGGG 


252 


CCCUCCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCUGCUG 


2441 


1490 


AGCAGCUU 


c 


AGGAGGGA 


253 


UCCCUCCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGCUGCU 


2442 


1502 


AGGGACGU 


c 


CAUCUGCA 


254 


UGCAGAUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACGUCCCU 


2443 


1506 


ACGUCCAU 


c 


UGCAGCGG 


255 


CCGCUGCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGGACGU 


2444 


1518 


AGCGGGCU 


u 


CGAUCGGC 


256 


GCCGAUCG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCCCGCU 


2445 


1519 


GCGGGCUU 


c 


GAUCGGCA 


257 


UGCCGAUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGCCCGC 


2446 


1523 


GCUUCGAU 


c 


GGCAUUUA 


258 


UAAAUGCC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCGAAGC 


2447 


1529 


AUCGGCAU 


u 


UACUGUGA 


259 


UCACAGUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGCCGAU 


2448 


1530 


UCGGCAUU 


u 


ACUGUGAU 


260 


AUCACAGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGCCGA 


2449 


1531 


CGGCAUUU 


A 


CUGUGAUU 


261 


AAUCACAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAUGCCG 


2450 


1539 


ACUGUGAU 


U 


AGGAAGAA 


262 


UUCUUCCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCACAGU 


2451 


1540 


CUGUGAUU 


A 


GGAAGAAA 


263 


UUUCUUCC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUCACAG 


2452 


1550 


GAAGAAAU 


A 


UCCAACUG 


264 


CAGUUGGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUCUUC 


2453 


1552 


AGAAAUAU 


C 


CAACUGAU 


265 


AUCAGUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAUUUCU 


2454 


1565 


UGAUGGAU 


C 


UGAAAUUG 


266 


CAAUUUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCCAUCA 


2455 


1572 


UCUGAAAU 


U 


GUGCUGCU 


267 


AGCAGCAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUCAGA 


2456 


1603 


ACAACACU 


A 


UAAGUGGG 


268 


CCCACUUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUGUUGU 


2457 


1605 


AACACUAU 


A 


AGUGGGUG 


269 


CACCCACU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAGUGUU 


2458 


1616 


UGGGUGCU 


U 


UAACGAGG 


270 


CCUCGUUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCACCCA 


2459 


lb 1 / 




u 


AACGAGGU 


2 71 


ACCUCGUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGCACCC 


2460 


1618 


GGUGCUUU 


A 


ACGAGGUC 


272 


GACCUCGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAGCACC 


2461 


1626 


AACGAGGU 


C 


AAACAAAG 


273 


CUUUGUUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACCUCGUU 


2462 


1644 


GGUGCCAU 


C 


AUCCACAC 


274 


GUGUGGAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGGCACC 


2463 


1647 


GCCAUCAU 


c 


CACACAGU 


275 


ACUGUGUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAUGGC 


2464 


1656 


CACACAGU 


c 


GCUUUGGG 


276 


CCCAAAGC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUGUGUG 


2465 



58 



1660 


CAGUCGCU 


U 


UGGGGCCC 


111 


GGGCCCCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCGACUG 


2466 


1661 


AGUCGCUU 


U 


GGGGCCCU 


278 


AGGGCCCC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGCGACU 


2467 


1670 


GGGGCCCU 


C 


UGCAGCUC 


IIS 


GAGCUGCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGGCCCC 


2468 


1678 


CUGCAGCU 


C 


AAGAACUA 


280 


UAGUUCUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCUGCAG 


2469 


1686 


CAAGAACU 


A 


GAGGAGCU 


281 


AGCUCCUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUUCUUG 


2470 


1697 


GGAGCUGU 


C 


CAAAAUGA 


282 


UCAUUUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAGCUCC 


2471 


1714 


CAGGAGGU 


U 


UACAGACA 


283 


UGUCUGUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACCUCCUG 


2472 


1715 


AGGAGGUU 


U 


ACAGACAU 


284 


AUGUCUGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACCUCCU 


2473 


1716 


GGAGGUUU 


A 


CAGACAUA 


285 


UAUGUCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAACCUCC 


2474 


1724 


ACAGACAU 


A 


UGCUUCAG 


286 


CUGAAGCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGUCUGU 


2475 


1729 


CAUAUGCU 


U 


CAGAUCAA 


287 


UUGAUCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCAUAUG 


2476 


1730 


AUAUGCUU 


C 


AGAUCAAG 


288 


CUUGAUCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGCAUAU 


2477 


1735 


CUUCAGAU 


C 


AAGUUCAG 


289 


CUGAACUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCUGAAG 


2478 


1740 


GAUCAAGU 


U 


CAGAACAA 


290 


UUGUUCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUUGAUC 


2479 


1741 


AUCAAGUU 


C 


AGAACAAU 


291 


AUUGUUCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACUUGAU 


2480 


1755 


AAUGGCCU 


C 


AUUGAUGC 


2 92 


GCAUCAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGCCAUU 


2481 


1758 


GGCCUCAU 


U 


GAUGCUUU 


293 


AAAGCAUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAGGCC 


2482 


1765 


UUGAUGCU 


U 


UUGGGGCC 


294 


GGCCCCAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCAUCAA 


2483 


1766 


UGAUGCUU 


U 


UGGGGCCC 


295 


GGGCCCCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGCAUCA 


2484 


1767 


GAUGCUUU 


U 


GGGGCCCU 


296 


AGGGCCCC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAGCAUC 


2485 


1776 


GGGGCCCU 


u 


UCAUCAGG 


297 


CCUGAUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGGCCCC 


2486 


1777 


GGGCCCUU 


u 


CAUCAGGA 


298 


UCCUGAUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGGGCCC 


2487 


1778 


GGCCCUUU 


c 


AUCAGGAA 


299 


UUCCUGAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAGGGCC 


2488 


1781 


CCUUUCAU 


c 


AGGAAAUG 


300 


CAUUUCCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAAAGG 


2489 


1797 


GGAGCUGU 


c 


UCUCAGCG 


301 


CGCUGAGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAGCUCC 


2490 


1799 


AGCUGUCU 


c 


UCAGCGCU 


302 


AGCGCUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGACAGCU 


2491 


1801 


CUGUCUCU 


c 


AGCGCUCC 


303 


GGAGCGCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAGACAG 


2492 


1808 


UCAGCGCU 


c 


CAUCCAGC 


304 


GCUGGAUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCGCUGA 


2493 


1812 


CGCUCCAU 


c 


CAGCUUGA 


305 


UCAAGCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGGAGCG 


2494 


1818 


AUCCAGCU 


u 


GAGAGUAA 


306 


UUACUCUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCUGGAU 


2495 


1825 


UUGAGAGU 


A 


AGGGAUUA 


307 


UAAUCCCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUCUCAA 


2496 


1832 


UAAGGGAU 


u 


AACCCUCC 


308 


GGAGGGUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCCCUUA 


2497 


1833 


AAGGGAUU 


A 


ACCCUCCA 


309 


UGGAGGGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUCCCUU 


2498 


1839 


UUAACCCU 


C 


CAGAACAG 


310 


CUGUUCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGGUUAA 


2499 


1872 


ACAGUGAU 


C 


GUGGACAG 


311 


CUGUCCAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCACUGU 


2500 


1900 


AGGACACU 


U 


UGUUUCUU 


312 


AAGAAACA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUGUCCU 


2501 


1901 


GGACACUU 


U 


GUUUCUUA 


313 


UAAGAAAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGUGUCC 


2502 


1904 


CACUUUGU 


U 


UCUUAUCA 


314 


UGAUAAGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAAAGUG 


2 503 


1905 


ACUUUGUU 


u 


CUUAUCAC 


315 


GUGAUAAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACAAAGU 


2504 


1906 


CUUUGUUU 


c 


UUAUCACC 


316 


GGUGAUAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAACAAAG 


2505 


1908 


UUGUUUCU 


u 


AUCACCUG 


317 


CAGGUGAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAAACAA 


2506 


1909 


UGUUUCUU 


A 


UCACCUGG 


318 


CCAGGUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGAAACA 


2507 


1911 


UUUCUUAU 


C 


ACCUGGAC 


319 


GUCCAGGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAAGAAA 


2508 


1930 


CGCAGCCU 


C 


CCCAAAUC 


320 


GAUUUGGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGCUGCG 


2509 


1938 


CCCCAAAU 


C 


CUUCUCUG 


321 


CAGAGAAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUGGGG 


2510 


1941 


CAAAUCCU 


U 


CUCUGGGA 


322 


UCCCAGAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGAUUUG 


2511 


1942 


AAAUCCUU 


C 


UCUGGGAU 


323 


AUCCCAGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGGAUUU 


2512 


1944 


AUCCUUCU 


C 


UGGGAUCC 


324 


GGAUCCCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAAGGAU 


2513 


1951 


UCUGGGAU 


C 


CCAGUGGA 


325 


UCCACUGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCCCAGA 


2514 


1976 


AGGUGGCU 


u 


UGUAGUGG 


326 


CCACUACA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCCACCU 


2515 


1977 


GGUGGCUU 


u 


GUAGUGGA 


327 


UCCACUAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGCCACC 


2516 


1980 


GGCUUUGU 


A 


GUGGACAA 


328 


UUGUCCAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAAAGCC 


2517 


2006 


AAUGGCCU 


A 


CCUCCAAA 


329 


UUUGGAGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGCCAUU 


2518 


2010 


GCCUACCU 


C 


CAAAUCCC 


330 


GGGAUUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGUAGGC 


2519 


2016 


CUCCAAAU 


C 


CCAGGCAU 


331 


AUGCCUGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUGGAG 


2520 


2025 


CCAGGCAU 


u 


GCUAAGGU 


332 


ACCUUAGC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGCCUGG 


2521 



59 



2029 


GCAUUGCU A AGGUUGGC 


333 


GCCAACCU CUGAUGAG GCCGUUAGGC CGAA AGCAAUGC 


2522 


2034 


GCUAAGGU U GGCACUUG 


334 


CAAGUGCC CUGAUGAG GCCGUUAGGC CGAA ACCUUAGC 


2523 


2041 


UUGGCACU U GGAAAUAC 


335 


GUAUUUCC CUGAUGAG GCCGUUAGGC CGAA AGUGCCAA 


2524 


2048 


UUGGAAAU A CAGUCUGC 


336 


GCAGACUG CUGAUGAG GCCGUUAGGC CGAA AUUUCCAA 


2525 


2053 


AAUACAGU C UGCAAGCA 


337 


UGCUUGCA CUGAUGAG GCCGUUAGGC CGAA ACUGUAUU 


2526 


2066 


AGCAAGCU C ACAAACCU 


338 


AGGUUUGU CUGAUGAG GCCGUUAGGC CGAA AGCUUGCU 


2527 


2075 


ACAAACCU U GACCCUGA 


339 


UCAGGGUC CUGAUGAG GCCGUUAGGC CGAA AGGUUUGU 


2528 


2088 


CUGACUGU C ACGUCCCG 


340 


CGGGACGU CUGAUGAG GCCGUUAGGC CGAA ACAGUCAG 


2529 


2093 


UGUCACGU C CCGUGCGU 


341 


ACGCACGG CUGAUGAG GCCGUUAGGC CGAA ACGUGACA 


2530 


2102 


CCGUGCGU C CAAUGCUA 


342 


UAGCAUUG CUGAUGAG GCCGUUAGGC CGAA ACGCACGG 


2531 


2110 


CCAAUGCU A CCCUGCCU 


343 


AGGCAGGG CUGAUGAG GCCGUUAGGC CGAA AGCAUUGG 


2532 


2119 


CCCUGCCU C CAAUUACA 


344 


UGUAAUUG CUGAUGAG GCCGUUAGGC CGAA AGGCAGGG 


2533 


2124 


CCUCCAAU U ACAGUGAC 


345 


GUCACUGU CUGAUGAG GCCGUUAGGC CGAA AUUGGAGG 


2534 


2125 


CUCCAAUU A CAGUGACU 


346 


AGUCACUG CUGAUGAG GCCGUUAGGC CGAA AAUUGGAG 


2535 


2134 


CAGUGACU U CCAAAACG 


347 


CGUUUUGG CUGAUGAG GCCGUUAGGC CGAA AGUCACUG 


2536 


2135 


AGUGACUU C CAAAACGA 


348 


UCGUUUUG CUGAUGAG GCCGUUAGGC CGAA AAGUCACU 


2537 


2162 


CAGCAAAU U CCCCAGCC 


349 


GGCUGGGG CUGAUGAG GCCGUUAGGC CGAA AUUUGCUG 


2538 


2163 


AGCAAAUU C CCCAGCCC 


350 


GGGCUGGG CUGAUGAG GCCGUUAGGC CGAA AAUUUGCU 


2539 


2173 


CCAGCCCU C UGGUAGUU 


351 


AACUACCA CUGAUGAG GCCGUUAGGC CGAA AGGGCUGG 


2540 


2178 


CCUCUGGU A GUUUAUGC 


352 


GCAUAAAC CUGAUGAG GCCGUUAGGC CGAA ACCAGAGG 


2541 


2181 


CUGGUAGU U UAUGCAAA 


353 


UUUGCAUA CUGAUGAG GCCGUUAGGC CGAA ACUACCAG 


2542 


2182 


UGGUAGUU U AUGCAAAU 


354 


AUUUGCAU CUGAUGAG GCCGUUAGGC CGAA AACUACCA 


2543 


2183 


GGUAGUUU A UGCAAAUA 


355 


UAUUUGCA CUGAUGAG GCCGUUAGGC CGAA AAACUACC 


2544 


2191 


AUGCAAAU A UUCGCCAA 


356 


UUGGCGAA CUGAUGAG GCCGUUAGGC CGAA AUUUGCAU 


2545 


2193 


GCAAAUAU U CGCCAAGG 


357 


CCUUGGCG CUGAUGAG GCCGUUAGGC CGAA AUAUUUGC 


2546 


2194 


CAAAUAUU C GCCAAGGA 


358 


UCCUUGGC CUGAUGAG GCCGUUAGGC CGAA AAUAUUUG 


2547 


2207 


AGGAGCCU C CCCAAUUC 


359 


GAAUUGGG CUGAUGAG GCCGUUAGGC CGAA AGGCUCCU 


2548 


2214 


UCCCCAAU U CUCAGGGC 


360 


GCCCUGAG CUGAUGAG GCCGUUAGGC CGAA AUUGGGGA 


2549 


2215 


CCCCAAUU C UCAGGGCC 


361 


GGCCCUGA CUGAUGAG GCCGUUAGGC CGAA AAUUGGGG 


2550 


2217 


CCAAUUCU C AGGGCCAG 


362 


CUGGCCCU CUGAUGAG GCCGUUAGGC CGAA AGAAUUGG 


2551 


2229 


GCCAGUGU C ACAGCCCU 


363 


AGGGCUGU CUGAUGAG GCCGUUAGGC CGAA ACACUGGC 


2552 


2241 


GCCCUGAU U GAAUCAGU 


364 


ACUGAUUC CUGAUGAG GCCGUUAGGC CGAA AUCAGGGC 


2553 


2246 


GAUUGAAU C AGUGAAUG 


365 


CAUUCACU CUGAUGAG GCCGUUAGGC CGAA AUUCAAUC 


2554 


2265 


AAAACAGU U ACCUUGGA 


366 


UCCAAGGU CUGAUGAG GCCGUUAGGC CGAA ACUGUUUU 


2555 


2266 


AAACAGUU A CCUUGGAA 


367 


UUCCAAGG CUGAUGAG GCCGUUAGGC CGAA AACUGUUU 


2556 


2270 


AGUUACCU U GGAACUAC 


368 


GUAGUUCC CUGAUGAG GCCGUUAGGC CGAA AGGUAACU 


2557 


2277 


UUGGAACU A CUGGAUAA 


369 


UUAUCCAG CUGAUGAG GCCGUUAGGC CGAA AGUUCCAA 


2558 


2284 


UACUGGAU A AUGGAGCA 


370 


UGCUCCAU CUGAUGAG GCCGUUAGGC CGAA AUCCAGUA 


2559 


2305 


CUGAUGCU A CUAAGGAU 


371 


AUCCUUAG CUGAUGAG GCCGUUAGGC CGAA AGCAUCAG 


2560 


2308 


AUGCUACU A AGGAUGAC 


372 


GUCAUCCU CUGAUGAG GCCGUUAGGC CGAA AGUAGCAU 


2561 


2322 


GACGGUGU C UACUCAAG 


373 


CUUGAGUA CUGAUGAG GCCGUUAGGC CGAA ACACCGUC 


2562 


2324 


CGGUGUCU A CUCAAGGU 


374 


ACCUUGAG CUGAUGAG GCCGUUAGGC CGAA AGACACCG 


2563 


2327 


UGUCUACU C AAGGUAUU 


375 


AAUACCUU CUGAUGAG GCCGUUAGGC CGAA AGUAGACA 


2564 


2333 


CUCAAGGU A UUUCACAA 


376 


UUGUGAAA CUGAUGAG GCCGUUAGGC CGAA ACCUUGAG 


2565 


2335 


CAAGGUAU U UCACAACU 


377 


AGUUGUGA CUGAUGAG GCCGUUAGGC CGAA AUACCUUG 


2566 


2336 


AAGGUAUU U CACAACUU 


378 


AAGUUGUG CUGAUGAG GCCGUUAGGC CGAA AAUACCUU 


2567 


2337 


AGGUAUUU C ACAACUUA 


379 


UAAGUUGU CUGAUGAG GCCGUUAGGC CGAA AAAUACCU 


2568 


2344 


UCACAACU U AUGACACG 


380 


CGUGUCAU CUGAUGAG GCCGUUAGGC CGAA AGUUGUGA 


2569 


2345 


CACAACUU A UGACACGA 


381 


UCGUGUCA CUGAUGAG GCCGUUAGGC CGAA AAGUUGUG 


2570 


2359 


CGAAUGGU A GAUACAGU 


382 


ACUGUAUC CUGAUGAG GCCGUUAGGC CGAA ACCAUUCG 


2571 


23 63 


UGGUAGAU A CAGUGUAA 


383 


UUACACUG CUGAUGAG GCCGUUAGGC CGAA AUCUACCA 


2572 


2370 


UACAGUGU A AAAGUGCG 


384 


CGCACUUU CUGAUGAG GCCGUUAGGC CGAA ACACUGUA 


2573 


2383 


UGCGGGCU C UGGGAGGA 


385 


UCCUCCCA CUGAUGAG GCCGUUAGGC CGAA AGCCCGCA 


2574 


2394 


GGAGGAGU U AACGCAGC 


386 


GCUGCGUU CUGAUGAG GCCGUUAGGC CGAA ACUCCUCC 


2575 


2395 


GAGGAGUU A ACGCAGCC 


387 


GGCUGCGU CUGAUGAG GCCGUUAGGC CGAA AACUCCUC 


2576 


2418 


AGAGUGAU A CCCCAGCA 


388 


UGCUGGGG CUGAUGAG GCCGUUAGGC CGAA AUCACUCU 


2577 
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2441 


AGCACUGU 


A 


CAUACCUG 


389 


CAGGUAUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAGUGCU 


2578 


2445 


CUGUACAU 


A 


CCUGGCUG 


390 


CAGCCAGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGUACAG 


2579 


2457 


GGCUGGAU 


U 


GAGAAUGA 


391 


UCAUUCUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCCAGCC 


2580 


2472 


GAUGAAAU 


A 


CAAUGGAA 


392 


UUCCAUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUCAUC 


2581 


2482 


AAUGGAAU 


C 


CACCAAGA 


393 


UCUUGGUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUCCAUU 


2582 


2499 


CCUGAAAU 


U 


AAUAAGGA 


3 94 


UCCUUAUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUCAGG 


2583 


2500 


CUGAAAUU 


A 


AUAAGGAU 


395 


AUCCUUAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUUUCAG 


2584 


2503 


AAAUUAAU 


A 


AGGAUGAU 


396 


AUCAUCCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUAAUUU 


2585 


2514 


GAUGAUGU 


U 


CAACACAA 


397 


UUGUGUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAUCAUC 


2586 


2515 


AUGAUGUU 


C 


AACACAAG 


398 


CUUGUGUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACAUCAU 


2587 


2533 


AAGUGUGU 


U 


UCAGCAGA 


399 


UCUGCUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACACACUU 


2588 


2534 


AGUGUGUU 


U 


CAGCAGAA 


400 


UUCUGCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACACACU 


2589 


2535 


GUGUGUUU 


C 


AGCAGAAC 


401 


GUUCUGCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAACACAC 


2590 


2546 


CAGAACAU 


C 


CUCGGGAG 


402 


CUCCCGAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGUUCUG 


2591 


2549 


AACAUCCU 


C 


GGGAGGCU 


403 


AGCCUCCC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGAUGUU 


2592 


2558 


GGGAGGCU 


C 


AUUUGUGG 


404 


CCACAAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCCUCCC 


2593 


2561 


AGGCUCAU 


U 


UGUGGCUU 


405 


AAGCCACA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAGCCU 


2594 


2562 


GGCUCAUU 


U 


GUGGCUUC 


406 


GAAGCCAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGAGCC 


2595 


2569 


UUGUGGCU 


u 


CUGAUGUC 


407 


GACAUCAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCCACAA 


2596 


2570 


UGUGGCUU 


c 


UGAUGUCC 


408 


GGACAUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGCCACA 


2597 


2577 


UCUGAUGU 


c 


CCAAAUGC 


409 


GCAUUUGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAUCAGA 


2598 


2587 


CAAAUGCU 


c 


CCAUACCU 


410 


AGGUAUGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCAUUUG 


2599 


2592 


GCUCCCAU 


A 


CCUGAUCU 


411 


AGAUCAGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGGGAGC 


2600 


2599 


UACCUGAU 


C 


UCUUCCCA 


412 


UGGGAAGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCAGGUA 


2601 


2601 


CCUGAUCU 


C 


UUCCCACC 


413 


GGUGGGAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAUCAGG 


2602 


2603 


UGAUCUCU 


u 


CCCACCUG 


414 


CAGGUGGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAGAUCA 


2603 


2604 


GAUCUCUU 


C 


CCACCUGG 


415 


CCAGGUGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGAGAUC 


2604 


2619 


GGCCAAAU 


C 


ACCGACCU 


416 


AGGUCGGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUGGCC 


2605 


2640 


GCGGAAAU 


u 


CACGGGGG 


417 


CCCCCGUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUCCGC 


2606 


2641 


CGGAAAUU 


c 


ACGGGGGC 


418 


GCCCCCGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUUUCCG 


2607 


2653 


GGGGCAGU 


c 


UCAUUAAU 


419 


AUUAAUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUGCCCC 


2608 


2655 


GGCAGUCU 


c 


AUUAAUCU 


420 


AGAUUAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGACUGCC 


2609 


2658 


AGUCUCAU 


u 


AAUCUGAC 


421 


GUCAGAUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAGACU 


2610 


2659 


GUCUCAUU 


A 


AUCUGACU 


422 


AGUCAGAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGAGAC 


2611 


2662 


UCAUUAAU 


C 


UGACUUGG 


423 


CCAAGUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUAAUGA 


2612 


2668 


AUCUGACU 


u 


GGACAGCU 


424 


AGCUGUCC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUCAGAU 


2613 


2677 


GGACAGCU 


C 


CUGGGGAU 


425 


AUCCCCAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCUGUCC 


2614 


2689 


GGGAUGAU 


u 


AUGACCAU 


426 


AUGGUCAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCAUCCC 


2615 


2690 


GGAUGAUU 


A 


UGACCAUG 


427 


CAUGGUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUCAUCC 


2616 


2707 


GAACAGCU 


c 


ACAAGUAU 


428 


AUACUUGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCUGUUC 


2617 


2714 


UCACAAGU 


A 


UAUCAUUC 


429 


GAAUGAUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUUGUGA 


2618 


2716 


ACAAGUAU 


A 


UCAUUCGA 


430 


UCGAAUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUACUUGU 


2619 


2718 


AAGUAUAU 


C 


AUUCGAAU 


431 


AUUCGAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAUACUU 


2620 


2721 


UAUAUCAU 


U 


CGAAUAAG 


432 


CUUAUUCG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAUAUA 


2621 


2722 


AUAUCAUU 


C 


GAAUAAGU 


433 


ACUUAUUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGAUAU 


2622 


2727 


AXJUCGAAU 


A 


AGUACAAG 


434 


CUUGUACU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUCGAAU 


2623 


2731 


GAAUAAGU 


A 


CAAGUAUU 


435 


AAUACUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUUAUUC 


2624 


2737 


GUACAAGU 


A 


UUCUUGAU 


436 


AUCAAGAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUUGUAC 


2625 


2739 


ACAAGUAU 


U 


CUUGAUCU 


437 


AGAUCAAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUACUUGU 


2626 


2740 


CAAGUAUU 


C 


UUGAUCUC 


438 


GAGAUCAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUACUUG 


2627 


2742 


AGUAUUCU 


u 


GAUCUCAG 


439 


CUGAGAUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAAUACU 


2628 


2746 


UUCUUGAU 


c 


UCAGAGAC 


440 


GUCUCUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCAAGAA 


2629 


2748 


CUUGAUCU 


c 


AGAGACAA 


441 


UUGUCUCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAUCAAG 


2630 


2759 


AGACAAGU 


u 


CAAUGAAU 


442 


AUUCAUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUUGUCU 


2631 


2760 


GACAAGUU 


c 


AAUGAAUC 


443 


GAUUCAUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACUUGUC 


2632 


2768 


CAAUGAAU 


c 


UCUUCAAG 


444 


CUUGAAGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUCAUUG 


2633 
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2770 


AUGAAUCU 


C 


UUCAAGUG 


445 


CACUUGAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAUUCAU 


2634 


2772 


GAAUCUCU 


U 


CAAGUGAA 


446 


UUCACUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAGAUUC 


2635 


2773 


AAUCUCUU 


C 


AAGUGAAU 


447 


AUUCACUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGAGAUU 


2636 


2782 


AAGUGAAU 


A 


CUACUGCU 


448 


AGCAGUAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUCACUU 


2637 


2785 


UGAAUACU 


A 


CUGCUCUC 


449 


GAGAGCAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUAUUCA 


2638 


2791 


CUACUGCU 


C 


UCAUCCCA 


450 


UGGGAUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCAGUAG 


2639 


2793 


ACUGCUCU 


C 


AUCCCAAA 


451 


UUUGGGAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAGCAGU 


2640 


2796 


GCUCUCAU 


C 


CCAAAGGA 


452 


UCCUUUGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAGAGC 


2641 


2813 


AGCCAACU 


C 


UGAGGAAG 


453 


CUUCCUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUUGGCU 


2642 


2823 


GAGGAAGU 


C 


UUUUUGUU 


454 


AACAAAAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUUCCUC 


2643 


2825 


GGAAGUCU 


U 


UUUGUUUA 


455 


UAAACAAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGACUUCC 


2644 


2826 


GAAGUCUU 


U 


UUGUUUAA 


456 


UUAAACAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGACUUC 


2645 


2827 


AAGUCUUU 


U 


UGUUUAAA 


457 


UUUAAACA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAGACUU 


2646 


2828 


AGUCUUUU 


U 


GUUUAAAC 


458 


GUUUAAAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAAGACU 


2647 


2831 


CUUUUUGU 


U 


UAAACCAG 


459 


CUGGUUUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAAAAAG 


2648 


2832 


UUUUUGUU 


U 


AAACCAGA 


460 


UCUGGUUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACAAAAA 


2649 


2833 


UUUUGUUU 


A 


AACCAGAA 


461 


UUCUGGUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAACAAAA 


2650 


2847 


GAAAACAU 


U 


ACUUUUGA 


462 


UCAAAAGU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGUUUUC 


2651 


2848 


AAAACAUU 


A 


CUUUUGAA 


463 


UUCAAAAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGUUUU 


2652 


2851 


ACAUUACU 


U 


UUGAAAAU 


464 


AUUUUCAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUAAUGU 


2653 


2852 


CAUUACUU 


u 


UGAAAAUG 


465 


CAUUUUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGUAAUG 


2654 


2853 


AUUACUUU 


u 


GAAAAUGG 


466 


CCAUUUUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAGUAAU 


2655 


2869 


GCACAGAU 


c 


UUUUCAUU 


467 


AAUGAAAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCUGUGC 


2656 


2871 


ACAGAUCU 


u 


UUCAUUGC 


468 


GCAAUGAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAUCUGU 


2657 


2872 


CAGAUCUU 


u 


UCAUUGCU 


469 


AGCAAUGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGAUCUG 


2658 


2873 


AGAUCUUU 


u 


CAUUGCUA 


470 


UAGCAAUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAGAUCU 


2659 


2874 


GAUCUUUU 


c 


AUUGCUAU 


471 


AUAGCAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAAGAUC 


2660 


2877 


CUUUUCAU 


u 


GCUAUUCA 


472 


UGAAUAGC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGAAAAG 


2661 


2881 


UCAUUGCU 


A 


UUCAGGCU 


473 


AGCCUGAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCAAUGA 


2662 


2883 


AUUGCUAU 


U 


CAGGCUGU 


474 


ACAGCCUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAGCAAU 


2663 


2884 


UUGCUAUU 


c 


AGGCUGUU 


475 


AACAGCCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUAGCAA 


2664 


2892 


CAGGCUGU 


u 


GAUAAGGU 


476 


ACCUUAUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAGCCUG 


2665 


2896 


CUGUUGAU 


A 


AGGUCGAU 


477 


AUCGACCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCAACAG 


2666 


2901 


GAUAAGGU 


C 


GAUCUGAA 


478 


UUCAGAUC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACCUUAUC 


2667 


2905 


AGGUCGAU 


C 


UGAAAUCA 


479 


UGAUUUCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCGACCU 


2668 


2912 


UCUGAAAU 


C 


AGAAAUAU 


480 


AUAUUUCU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUCAGA 


2669 


2919 


UCAGAAAU 


A 


UCCAACAU 


481 


AUGUUGGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUCUGA 


2670 


2921 


AGAAAUAU 


C 


CAACAUUG 


482 


CAAUGUUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAUUUCU 


2671 


2928 


UCCAACAU 


U 


GCACGAGU 


483 


ACUCGUGC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGUUGGA 


2672 


2937 


GCACGAGU 


A 


UCUUUGUU 


484 


AACAAAGA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUCGUGC 


2673 


2939 


ACGAGUAU 


C 


UUUGUUUA 


485 


UAAACAAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUACUCGU 


2674 


2941 


GAGUAUCU 


u 


UGUUUAUU 


486 


AAUAAACA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGAUACUC 


2675 


2942 


AGUAUCUU 


u 


GUUUAUUC 


487 


GAAUAAAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGAUACU 


2676 


2945 


AUCUUUGU 


u 


UAUUCCUC 


488 


GAGGAAUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAAAGAU 


2677 


2946 


UCUUUGUU 


u 


AUUCCUCC 


489 


GGAGGAAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AACAAAGA 


2678 


2947 


CUUUGUUU 


A 


UUCCUCCA 


490 


UGGAGGAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAACAAAG 


2679 


2949 


UUGUUUAU 


U 


CCUCCACA 


491 


UGUGGAGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAAACAA 


2680 


2950 


UGUUUAUU 


C 


CUCCACAG 


492 


CUGUGGAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUAAACA 


2681 


2953 


UUAUUCCU 


C 


CACAGACU 


493 


AGUCUGUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGAAUAA 


2682 


2962 


CACAGACU 


C 


CGCCAGAG 


494 


CUCUGGCG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUCUGUG 


2683 


2 977 


AGACACCU 


A 


GUCCUGAU 


4 95 


AUCAGGAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGUGUCU 


2684 


2980 


CACCUAGU 


C 


CUGAUGAA 


496 


UUCAUCAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUAGGUG 


2685 


2993 


UGAAACGU 


C 


UGCUCCUU 


497 


AAGGAGCA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACGUUUCA 


2686 


2998 


CGUCUGCU 


C 


CUUGUCCU 


498 


AGGACAAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCAGACG 


2687 


3001 


CUGCUCCU 


u 


GUCCUAAU 


499 


AUUAGGAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGGAGCAG 


2688 


3004 


CUCCUUGU 


c 


CUAAUAUU 


500 


AAUAUUAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAAGGAG 


2689 
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3007 


CUUGUCCU A AUAUUCAU 


501 


AUGAAUAU CUGAUGAG GCCGUUAGGC CGAA AGGACAAG 


2690 


3010 


GUCCUAAU A UUCAUAUC 


502 


GAUAUGAA CUGAUGAG GCCGUUAGGC CGAA AUUAGGAC 


2691 


3012 


CCUAAUAU U CAUAUCAA 


503 


UUGAUAUG CUGAUGAG GCCGUUAGGC CGAA AUAUUAGG 


2692 


3013 


CUAAUAUU C AUAUCAAC 


504 


GUUGAUAU CUGAUGAG GCCGUUAGGC CGAA AAUAUUAG 


2693 


3016 


AUAUUCAU A UCAACAGC 


505 


GCUGUUGA CUGAUGAG GCCGUUAGGC CGAA AUGAAUAU 


2694 


3018 


AUUCAUAU C AACAGCAC 


506 


GUGCUGUU CUGAUGAG GCCGUUAGGC CGAA AUAUGAAU 


2695 


3030 


AGCACCAU U CCUGGCAU 


507 


AUGCCAGG CUGAUGAG GCCGUUAGGC CGAA AUGGUGCU 


2696 


3031 


GCACCAUU C CUGGCAUU 


508 


AAUGCCAG CUGAUGAG GCCGUUAGGC CGAA AAUGGUGC 


2697 


3039 


CCUGGCAU U CACAUUUU 


509 


AAAAUGUG CUGAUGAG GCCGUUAGGC CGAA AUGCCAGG 


2698 


3040 


CUGGCAUU C ACAUUUUA 


510 


UAAAAUGU CUGAUGAG GCCGUUAGGC CGAA AAUGCCAG 


2699 


3045 


AUUCACAU U UUAAAAAU 


511 


AUUUUUAA CUGAUGAG GCCGUUAGGC CGAA AUGUGAAU 


2700 


3046 


UUCACAUU U UAAAAAUU 


512 


AAUUUUUA CUGAUGAG GCCGUUAGGC CGAA AAUGUGAA 


2701 


3047 


UCACAUUU U AAAAAUUA 


513 


UAAUUUUU CUGAUGAG GCCGUUAGGC CGAA AAAUGUGA 


2702 


3048 


CACAUUUU A AAAAUUAU 


514 


AUAAUUUU CUGAUGAG GCCGUUAGGC CGAA AAAAUGUG 


2703 


3054 


UUAAAAAU U AUGUGGAA 


515 


UUCCACAU CUGAUGAG GCCGUUAGGC CGAA AUUUUUAA 


2704 


3055 


UAAAAAUU A UGUGGAAG 


516 


CUUCCACA CUGAUGAG GCCGUUAGGC CGAA AAUUUUUA 


2705 


3069 


AAGUGGAU A GGAGAACU 


517 


AGUUCUCC CUGAUGAG GCCGUUAGGC CGAA AUCCACUU 


2706 


3086 


GCAGCUGU C AAUAGCCU 


518 


AGGCUAUU CUGAUGAG GCCGUUAGGC CGAA ACAGCUGC 


2707 


3090 


CUGUCAAU A GCCUAGGG 


519 


CCCUAGGC CUGAUGAG GCCGUUAGGC CGAA AUUGACAG 


2708 


3095 


AAUAGCCU A GGGCUGAA 


520 


UUCAGCCC CUGAUGAG GCCGUUAGGC CGAA AGGCUAUU 


2709 


3105 


GGCUGAAU U UUUGUCAG 


521 


CUGACAAA CUGAUGAG GCCGUUAGGC CGAA AUUCAGCC 


2710 


3106 


GCUGAAUU U UUGUCAGA 


522 


UCUGACAA CUGAUGAG GCCGUUAGGC CGAA AAUUCAGC 


2711 


3107 


CUGAAUUU U UGUCAGAU 


523 


AUCUGACA CUGAUGAG GCCGUUAGGC CGAA AAAUUCAG 


2712 


3108 


UGAAUUUU U GUCAGAUA 


524 


UAUCUGAC CUGAUGAG GCCGUUAGGC CGAA AAAAUUCA 


2713 


3111 


AUUUUUGU C AGAUAAAU 


525 


AUUUAUCU CUGAUGAG GCCGUUAGGC CGAA ACAAAAAU 


2714 


3116 


UGUCAGAU A AAUAAAAU 


526 


AUUUUAUU CUGAUGAG GCCGUUAGGC CGAA AUCUGACA 


2715 


3120 


AGAUAAAU A AAAUAAAU 


527 


AUUUAUUU CUGAUGAG GCCGUUAGGC CGAA AUUUAUCU 


2716 


3125 


AAUAAAAU A AAUCAUUC 


528 


GAAUGAUU CUGAUGAG GCCGUUAGGC CGAA AUUUUAUU 


2717 


3129 


AAAUAAAU C AUUCAUCC 


529 


GGAUGAAU CUGAUGAG GCCGUUAGGC CGAA AUUUAUUU 


2718 


3132 


UAAAUCAU U CAUCCUUU 


530 


AAAGGAUG CUGAUGAG GCCGUUAGGC CGAA AUGAUUUA 


2719 


3133 


AAAUCAUU C AUCCUUUU 


531 


AAAAGGAU CUGAUGAG GCCGUUAGGC CGAA AAUGAUUU 


2720 


3136 


UCAUUCAU C CUUUUUUU 


532 


AAAAAAAG CUGAUGAG GCCGUUAGGC CGAA AUGAAUGA 


2721 


3139 


UUCAUCCU U UUUUUGAU 


533 


AUCAAAAA CUGAUGAG GCCGUUAGGC CGAA AGGAUGAA 


2722 


3140 


UCAUCCUU U UUUUGAUU 


534 


AAUCAAAA CUGAUGAG GCCGUUAGGC CGAA AAGGAUGA 


2723 


3141 


CAUCCUUU U UUUGAUUA 


535 


UAAUCAAA CUGAUGAG GCCGUUAGGC CGAA AAAGGAUG 


2724 


3142 


AUCCUUUU U UUGAUUAU 


536 


AUAAUCAA CUGAUGAG GCCGUUAGGC CGAA AAAAGGAU 


2725 


3143 


UCCUUUUU U UGAUUAUA 


537 


UAUAAUCA CUGAUGAG GCCGUUAGGC CGAA AAAAAGGA 


2726 


3144 


CCUUUUUU U GAUUAUAA 


538 


UUAUAAUC CUGAUGAG GCCGUUAGGC CGAA AAAAAAGG 


2727 


3148 


UUUUUGAU U AUAAAAUU 


539 


AAUUUUAU CUGAUGAG GCCGUUAGGC CGAA AUCAAAAA 


2728 


3149 


UUUUGAUU A UAAAAUUU 


540 


AAAUUUUA CUGAUGAG GCCGUUAGGC CGAA AAUCAAAA 


2729 


3151 


UUGAUUAU A AAAUUUUC 


541 


GAAAAUUU CUGAUGAG GCCGUUAGGC CGAA AUAAUCAA 


2730 


3156 


UAUAAAAU U UUCUAAAA 


542 


UUUUAGAA CUGAUGAG GCCGUUAGGC CGAA AUUUUAUA 


2731 


3157 


AUAAAAUU U UCUAAAAU 


543 


AUUUUAGA CUGAUGAG GCCGUUAGGC CGAA AAUUUUAU 


2732 


3158 


UAAAAUUU U CUAAAAUG 


544 


CAUUUUAG CUGAUGAG GCCGUUAGGC CGAA AAAUUUUA 


2733 


3159 


AAAAUUUU C UAAAAUGU 


545 


ACAUUUUA CUGAUGAG GCCGUUAGGC CGAA AAAAUUUU 


2734 


3161 


AAUUUUCU A AAAUGUAU 


546 


AUACAUUU CUGAUGAG GCCGUUAGGC CGAA AGAAAAUU 


2735 


3168 


UAAAAUGU A UUUUAGAC 


547 


GUCUAAAA CUGAUGAG GCCGUUAGGC CGAA ACAUUUUA 


2736 


3170 


AAAUGUAU U UUAGACUU 


548 


AAGUCUAA CUGAUGAG GCCGUUAGGC CGAA AUACAUUU 


2737 


3260 


AAAUGUAU U UUAGACUU 


548 


AAGUCUAA CUGAUGAG GCCGUUAGGC CGAA AUACAUUU 


2737 


3171 


AAUGUAUU U UAGACUUC 


549 


GAAGUCUA CUGAUGAG GCCGUUAGGC CGAA AAUACAUU 


2738 


3261 


AAUGUAUU U UAGACUUC 


549 


GAAGUCUA CUGAUGAG GCCGUUAGGC CGAA AAUACAUU 


2738 


3172 


AUGUAUUU U AGACUUCC 


550 


GGAAGUCU CUGAUGAG GCCGUUAGGC CGAA AAAUACAU 


2739 


3262 


AUGUAUUU U AGACUUCC 


550 


GGAAGUCU CUGAUGAG GCCGUUAGGC CGAA AAAUACAU 


2739 


3173 


UGUAUUUU A GACUUCCU 


551 


AGGAAGUC CUGAUGAG GCCGUUAGGC CGAA AAAAUACA 


2740 


3263 


UGUAUUUU A GACUUCCU 


551 


AGGAAGUC CUGAUGAG GCCGUUAGGC CGAA AAAAUACA 


2740 


3178 


UUUAGACU U CCUGUAGG 


552 


CCUACAGG CUGAUGAG GCCGUUAGGC CGAA AGUCUAAA 


2741 



63 



3268 


UUUAGACU 


U 


CCUGUAGG 


552 


CCUACAGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUCUAAA 


2741 


3179 


UUAGACUU 


C 


CUGUAGGG 


553 


CCCUACAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGUCUAA 


2742 


3269 


UUAGACUU 


C 


CUGUAGGG 


553 


CCCUACAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAGUCUAA 


2742 


3184 


CUUCCUGU 


A 


GGGGGCGA 


554 


UCGCCCCC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAGGAAG 


2743 


3274 


CUUCCUGU 


A 


GGGGGCGA 


554 


UCGCCCCC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAGGAAG 


2743 


3194 


GGGGCGAU 


A 


UACUAAAU 


555 


AUUUAGUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCGCCCC 


2744 


3247 


GGGGCGAU 


A 


UACUAAAU 


555 


AUUUAGUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCGCCCC 


2744 


3196 


GGCGAUAU 


A 


CUAAAUGU 


556 


ACAUUUAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAUCGCC 


2745 


3249 


GGCGAUAU 


A 


CUAAAUGU 


556 


ACAUUUAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAUCGCC 


2745 


3199 


GAUAUACU 


A 


AAUGUAUA 


557 


UAUACAUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUAUAUC 


2746 


3205 


CUAAAUGU 


A 


UAUAGUAC 


558 


GUACUAUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAUUUAG 


2747 


3207 


AAAUGUAU 


A 


UAGUACAU 


559 


AUGUACUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUACAUUU 


2748 


3209 


AUGUAUAU 


A 


GUACAUUU 


560 


AAAUGUAC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAUACAU 


2749 


3212 


UAUAUAGU 


A 


CAUUUAUA 


561 


UAUAAAUG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACUAUAUA 


2750 


3216 


UAGUACAU 


U 


UAUACUAA 


562 


UUAGUAUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUGUACUA 


2751 


3217 


AGUACAUU 


U 


AUACUAAA 


563 


UUUAGUAU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUGUACU 


2752 


3218 


GUACAUUU 


A 


UACUAAAU 


564 


AUUUAGUA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAAUGUAC 


2753 


3220 


ACAUUUAU 


A 


CUAAAUGU 


565 


ACAUUUAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUAAAUGU 


2754 


3223 


UUUAUACU 


A 


AAUGUAUU 


566 


AAUACAUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUAUAAA 


2755 


3229 


CUAAAUGU 


A 


UUCCUGUA 


567 


UACAGGAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAUUUAG 


2756 


3231 


AAAUGUAU 


U 


CCUGUAGG 


568 


CCUACAGG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUACAUUU 


2757 


3232 


AAUGUAUU 


C 


CUGUAGGG 


569 


CCCUACAG 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AAUACAUU 


2758 


3237 


AUUCCUGU 


A 


GGGGGCGA 


570 


UCGCCCCC 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAGGAAU 


2759 


3252 


GAUAUACU 


A 


AAUGUAUU 


571 


AAUACAUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGUAUAUC 


2760 


3258 


CUAAAUGU 


A 


UUUUAGAC 


572 


GUCUAAAA 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


ACAUUUAG 


2761 


3284 


GGGGCGAU 


A 


AAAUAAAA 


573 


UUUUAUUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUCGCCCC 


2762 


3289 


GAUAAAAU 


A 


AAAUGCUA 


574 


UAGCAUUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AUUUUAUC 


2763 


3297 


AAAAUGCU 


A 


AACAACUG 


575 


CAGUUGUU 


CUGAUGAG 


GCCGUUAGGC 


CGAA 


AGCAUUUU 


2764 



Input Sequence = NM_00 1285. Cut Site = UH/. 

Arm Length = 8. Core Sequence = CUGAUGAG GCCGUUAGGC CGAA 
Underlined region can be any X sequence or linker, as described herein. 
NM_001285 (Homo sapiens chloride channel, calcium activated, 1 (CLCA1) 
mRNA,3311 bp) 
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Table VII: Hmmao CLCA1 DNAzymie and Target Sequence 249.021 



Pos 


Substrate 


Seq ID 
No 


DNAzyme 


Rz 
Seq ID 
No 


17 


CUUUUGGU A CAAAUGGA 


4 


TCCATTTG GGCTAGCTACAACGA ACCAAAAG 


3919 


34 


UGUGGAAU A UAAUUGAA 


5 


TTCAATTA GGCTAGCTACAACGA ATTCCACA 


3920 


44 


AAUUGAAU A UUUUCUUG 


8 


CAAGAAAA GGCTAGCTACAACGA ATTCAATT 


3921 


84 


UUGAGGUU A UGUCAAGC 


19 


GCTTGACA GGCTAGCTACAACGA AACCTCAA 


3922 


122 


AUGGAAAU A UUUACAAG 


22 


CTTGTAAA GGCTAGCTACAACGA ATTTCCAT 


3923 


126 


AAAUAUUU A CAAGUACG 


25 


CGTACTTG GGCTAGCTACAACGA AAATATTT 


3924 


132 


UUACAAGU A CGCAAUUU 


26 


AAATTGCG GGCTAGCTACAACGA ACTTGTAA 


3925 


152 


ACUAAGAU A UUGUUAUC 


30 


GATAACAA GGCTAGCTACAACGA ATCTTAGT 


3926 


158 


AUAUUGUU A UCAUUCUC 


33 


GAGAATGA GGCTAGCTACAACGA AACAATAT 


3927 


169 


AUUCUCCU A UUGAAGAC 


38 


GTCTTCAA GGCTAGCTACAACGA AGGAGAAT 


3928 


259 


GUGUGUCU A UAUUUUCA 


52 


TGAAAATA GGCTAGCTACAACGA AGACACAC 


3929 


261 


GUGUCUAU A UUUUCAUA 


53 


TATGAAAA GGCTAGCTACAACGA ATAGACAC 


3930 


269 


AUUUUCAU A UCUGUAUA 


58 


TATACAGA GGCTAGCTACAACGA ATGAAAAT 


3931 


275 


AUAUCUGU A UAUAUAUA 


60 


TATATATA GGCTAGCTACAACGA ACAGATAT 


3932 


277 


AUCUGUAU A UAUAUAAU 


61 


ATTATATA GGCTAGCTACAACGA ATACAGAT 


3933 


279 


CUGUAUAU A UAUAAUGG 


62 


CCATTATA GGCTAGCTACAACGA ATATACAG 


3934 


281 


GUAUAUAU A UAAUGGUA 


63 


TACCATTA GGCTAGCTACAACGA ATATATAC 


3935 


346 


GGAGAUGU A CAGCAAUG 


74 


CATTGCTG GGCTAGCTACAACGA ACATCTCC 


3936 


446 


CAAUGGCU A UGAAGGCA 


97 


TGCCTTCA GGCTAGCTACAACGA AGCCATTG 


3937 


539 


AUCUCUGU A UCUGUUUG 


108 


CAAACAGA GGCTAGCTACAACGA ACAGAGAT 


3938 


553 


UUGAAGCU A CAGGAAAG 


112 


CTTTCCTG GGCTAGCTACAACGA AGCTTCAA 


3939 


569 


GCGAUUUU A UUUCAAAA 


116 


TTTTGAAA GGCTAGCTACAACGA AAAATCGC 


3940 


623 


GGCUGACU A UGUGAGAC 


126 


GTCTCACA GGCTAGCTACAACGA AGTCAGCC 


3941 


647 


UGAGACCU A CAAAAAUG 


128 


CATTTTTG GGCTAGCTACAACGA AGGTCTCA 


3942 


679 


CUGAGUCU A CUCCUCCA 


133 


TGGAGGAG GGCTAGCTACAACGA AGACTCAG 


3943 


704 


UGAACCCU A CACUGAGC 


137 


GCTCAGTG GGCTAGCTACAACGA AGGGTTCA 


3944 


791 


AGCUGAAU A UGGACCAC 


147 


GTGGTCCA GGCTAGCTACAACGA ATTCAGCT 


3945 


834 


GCUCAUCU A CGAUGGGG 


154 


CCCCATCG GGCTAGCTACAACGA AGATGAGC 


3946 


846 


UGGGGAGU A UUUGACGA 


155 


TCGTCAAA GGCTAGCTACAACGA ACTCCCCA 


3947 


857 


UGACGAGU A CAAUAAUG 


158 


CATTATTG GGCTAGCTACAACGA ACTCGTCA 


3948 


878 


GAAAUUCU A CUUAUCCA 


162 


TGGATAAG GGCTAGCTACAACGA AGAATTTC 


3949 


882 


UUCUACUU A UCCAAUGG 


164 


CCATTGGA GGCTAGCTACAACGA AAGTAGAA 


3950 


897 


GGAAGAAU A CAAGCAGU 


166 


ACTGCTTG GGCTAGCTACAACGA ATTCTTCC 


3951 


922 


CAGCAGGU A UUACUGGU 


170 


ACCAGTAA GGCTAGCTACAACGA ACCTGCTG 


3952 


925 


CAGGUAUU A CUGGUACA 


172 


TGTACCAG GGCTAGCTACAACGA AATACCTG 


3953 


931 


UUACUGGU A CAAAUGUA 


173 


TACATTTG GGCTAGCTACAACGA ACCAGTAA 


3954 


968 


CAGCUGUU A CACCAAAA 


178 


TTTTGGTG GGCTAGCTACAACGA AACAGCTG 


3955 


997 


AUAAAGUU A CAGGACUC 


183 


GAGTCCTG GGCTAGCTACAACGA AACTTTAT 


3956 


1007 


AGGACUCU A UGAAAAAG 


185 


CTTTTTCA GGCTAGCTACAACGA AGAGTCCT 


3957 


1060 


AGGCUUCU A UAAUGUUU 


194 


AAACATTA GGCTAGCTACAACGA AGAAGCCT 


3958 


1087 


UUGAUUCU A UAGUUGAA 


201 


TTCAACTA GGCTAGCTACAACGA AGAATCAA 


3959 


1102 


AAUUCUGU A CAGAACAA 


206 


TTGTTCTG GGCTAGCTACAACGA ACAGAATT 


3960 


1213 


CCACUCCU A UGACAACA 


218 


TGTTGTCA GGCTAGCTACAACGA AGGAGTGG 


3961 


1416 


GCCCAUGU A CAAAGUGA 


245 


TCACTTTG GGCTAGCTACAACGA ACATGGGC 


3962 


1431 


GAACUCAU A CAGAUAAA 


247 


TTTATCTG GGCTAGCTACAACGA ATGAGTTC 


3963 


1476 


AAAAGAUU A CCUGCAGC 


251 


GCTGCAGG GGCTAGCTACAACGA AATCTTTT 


3964 


1531 


CGGCAUUU A CUGUGAUU 


261 


AATCACAG GGCTAGCTACAACGA AAATGCCG 


3965 


1550 


GAAGAAAU A UCCAACUG 


264 


CAGTTGGA GGCTAGCTACAACGA ATTTCTTC 


3966 


1603 


ACAACACU A UAAGUGGG 


268 


CCCACTTA GGCTAGCTACAACGA AGTGTTGT 


3967 


1716 


GGAGGUUU A CAGACAUA 


285 


TATGTCTG GGCTAGCTACAACGA AAACCTCC 


3968 


1724 


ACAGACAU A UGCUUCAG 


286 


CTGAAGCA GGCTAGCTACAACGA ATGTCTGT 


3969 



102 



1909 


UGUUUCUU 


A 


UCACCUGG 


318 


CCAGGTGA 


GGCTAGCTACAACGA 


AAGAAACA 


3970 


2006 


AAUGGCCU 


A 


CCUCCAAA 


329 


TTTGGAGG 


GGCTAGCTACAACGA 


AGGCCATT 


3971 


2048 


UUGGAAAU 


A 


CAGUCUGC 


336 


GCAGACTG 


GGCTAGCTACAACGA 


ATTTCCAA 


3972 


2110 


CCAAUGCU 


A 


CCCUGCCU 


343 


AGGCAGGG 


GGCTAGCTACAACGA 


AGCATTGG 


3973 


2125 


CUCCAAUU 


A 


CAGUGACU 


346 


AGTCACTG 


GGCTAGCTACAACGA 


AATTGGAG 


3974 


2183 


GGUAGUUU 


A 


UGCAAAUA 


355 


TATTTGCA 


GGCTAGCTACAACGA 


AAACTACC 


3975 


2191 


AUGCAAAU 


A 


UUCGCCAA 


356 


TTGGCGAA 


GGCTAGCTACAACGA 


ATTTGCAT 


3976 


2266 


AAACAGUU 


A 


CCUUGGAA 


367 


TTCCAAGG 


GGCTAGCTACAACGA 


AACTGTTT 


3977 


2277 


UUGGAACU 


A 


CUGGAUAA 


369 


TTATCCAG 


GGCTAGCTACAACGA 


AGTTCCAA 


3978 


2305 


CUGAUGCU 


A 


CUAAGGAU 


371 


ATCCTTAG 


GGCTAGCTACAACGA 


AGCATCAG 


3979 


2324 


CGGUGUCU 


A 


CUCAAGGU 


374 


ACCTTGAG 


GGCTAGCTACAACGA 


AGACACCG 


3980 


2333 


CUCAAGGU 


A 


UUUCACAA 


376 


TTGTGAAA 


GGCTAGCTACAACGA 


ACCTTGAG 


3981 


2345 


CACAACUU 


A 


UGACACGA 


381 


TCGTGTCA 


GGCTAGCTACAACGA 


AAGTTGTG 


3982 


2363 


UGGUAGAU 


A 


CAGUGUAA 


383 


TTACACTG 


GGCTAGCTACAACGA 


ATCTACCA 


3983 


2418 


AGAGUGAU 


A 


CCCCAGCA 


388 


TGCTGGGG 


GGCTAGCTACAACGA 


ATCACTCT 


3984 


2441 


AGCACUGU 


A 


CAUACCUG 


389 


CAGGTATG 


GGCTAGCTACAACGA 


ACAGTGCT 


3985 


2445 


CUGUACAU 


A 


CCUGGCUG 


390 


CAGCCAGG 


GGCTAGCTACAACGA 


ATGTACAG 


3986 


2472 


GAUGAAAU 


A 


CAAUGGAA 


3 92 


TTCCATTG 


GGCTAGCTACAACGA 


ATTTCATC 


3987 


2592 


GCUCCCAU 


A 


CCUGAUCU 


411 


AGATCAGG 


GGCTAGCTACAACGA 


ATGGGAGC 


3988 


2690 


GGAUGAUU 


A 


UGACCAUG 


427 


CATGGTCA 


GGCTAGCTACAACGA 


AATCATCC 


3989 


2714 


UCACAAGU 


A 


UAUCAUUC 


429 


GAATGATA 


GGCTAGCTACAACGA 


ACTTGTGA 


3990 


2716 


ACAAGUAU 


A 


UCAUUCGA 


430 


TCGAATGA 


GGCTAGCTACAACGA 


ATACTTGT 


3991 


2731 


GAAUAAGU 


A 


CAAGUAUU 


435 


AATACTTG 


GGCTAGCTACAACGA 


ACTTATTC 


3992 


2737 


GUACAAGU 


A 


UUCUUGAU 


436 


ATCAAGAA 


GGCTAGCTACAACGA 


ACTTGTAC 


3993 


2782 


AAGUGAAU 


A 


CUACUGCU 


448 


AGCAGTAG 


GGCTAGCTACAACGA 


ATTCACTT 


3994 


2785 


UGAAUACU 


A 


CUGCUCUC 


449 


GAGAGCAG 


GGCTAGCTACAACGA 


AGTATTCA 


3995 


2848 


AAAACAUU 


A 


CUUUUGAA 


463 


TTCAAAAG 


GGCTAGCTACAACGA 


AATGTTTT 


3996 


2881 


UCAUUGCU 


A 


UUCAGGCU 


473 


AGCCTGAA 


GGCTAGCTACAACGA 


AGCAATGA 


3997 


2919 


UCAGAAAU 


A 


UCCAACAU 


481 


ATGTTGGA 


GGCTAGCTACAACGA 


ATTTCTGA 


3998 


2937 


GCACGAGU 


A 


UCUUUGUU 


484 


AACAAAGA 


GGCTAGCTACAACGA 


ACTCGTGC 


3999 


2947 


CUUUGUUU 


A 


UUCCUCCA 


490 


TGGAGGAA 


GGCTAGCTACAACGA 


AAACAAAG 


4000 


3010 


GUCCUAAU 


A 


UUCAUAUC 


502 


GATATGAA 


GGCTAGCTACAACGA 


ATTAGGAC 


4001 


3016 


AUAUUCAU 


A 


UCAACAGC 


505 


GCTGTTGA 


GGCTAGCTACAACGA 


ATGAATAT 


4002 


3055 


UAAAAAUU 


A 


UGUGGAAG 


516 


CTTCCACA 


GGCTAGCTACAACGA 


AATTTTTA 


4003 


3149 


UUUUGAUU 


A 


UAAAAUUU 


540 


AAATTTTA 


GGCTAGCTACAACGA 


AATCAAAA 


4004 


3168 


UAAAAUGU 


A 


UUUUAGAC 


547 


GTCTAAAA 


GGCTAGCTACAACGA 


ACATTTTA 


4005 


3194 


GGGGCGAU 


A 


UACUAAAU 


555 


ATTTAGTA 


GGCTAGCTACAACGA 


ATCGCCCC 


4006 


3247 


GGGGCGAU 


A 


UACUAAAU 


555 


ATTTAGTA 


GGCTAGCTACAACGA 


ATCGCCCC 


4006 


3196 


GGCGAUAU 


A 


CUAAAUGU 


556 


ACATTTAG 


GGCTAGCTACAACGA 


ATATCGCC 


4007 


3249 


GGCGAUAU 


A 


CUAAAUGU 


556 


ACATTTAG 


GGCTAGCTACAACGA 


ATATCGCC 


4007 


3205 


CUAAAUGU 


A 


UAUAGUAC 


558 


GTACTATA 


GGCTAGCTACAACGA 


ACATTTAG 


4008 


3207 


AAAUGUAU 


A 


UAGUACAU 


559 


ATGTACTA 


GGCTAGCTACAACGA 


ATACATTT 


4009 


3212 


UAUAUAGU 


A 


CAUUUAUA 


561 


TATAAATG 


GGCTAGCTACAACGA 


ACTATATA 


4010 


3218 


GUACAUUU 


A 


UACUAAAU 


564 


ATTTAGTA 


GGCTAGCTACAACGA 


AAATGTAC 


4011 


3220 


ACAUUUAU 


A 


CUAAAUGU 


565 


ACATTTAG 


GGCTAGCTACAACGA 


ATAAATGT 


4012 


3229 


CUAAAUGU 


A 


UUCCUGUA 


567 


TACAGGAA 


GGCTAGCTACAACGA 


ACATTTAG 


4013 


3258 


CUAAAUGU 


A 


UUUUAGAC 


572 


GTCTAAAA 


GGCTAGCTACAACGA 


ACATTTAG 


4014 


65 


AGGGGAGC 


A 


UGAAGAGG 


579 


CCTCTTCA 


GGCTAGCTACAACGA 


GCTCCCCT 


4015 


93 


UGUCAAGC 


A 


UCUGGCAC 


581 


GTGCCAGA 


GGCTAGCTACAACGA 


GCTTGACA 


4016 


100 


CAUCUGGC 


A 


CAGCUGAA 


583 


TTCAGCTG 


GGCTAGCTACAACGA 


GCCAGATG 


4017 


161 


UUGUUAUC 


A 


UUCUCCUA 


590 


TAGGAGAA 


GGCTAGCTACAACGA 


GATAACAA 


4018 


195 


AGUAAAAC 


A 


CAUCAGGU 


596 


ACCTGATG 


GGCTAGCTACAACGA 


GTTTTACT 


4019 


197 


UAAAACAC 


A 


UCAGGUCA 


597 


TGACCTGA 


GGCTAGCTACAACGA 


GTGTTTTA 


4020 


231 


GAUAAACC 


A 


CUUCCGAU 


603 


ATCGGAAG 


GGCTAGCTACAACGA 


GGTTTATC 


4021 


267 


AUAUUUUC 


A 


UAUCUGUA 


607 


TACAGATA 


GGCTAGCTACAACGA 


GAAAATAT 


4022 


299 


AGAAAGAC 


A 


CCUUCGUA 


609 


TACGAAGG 


GGCTAGCTACAACGA 


GTCTTTCT 


4023 
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314 


UAACCCGC A UUUUCCAA 


614 


TTGGAAAA GGCTAGCTACAACGA GCGGGTTA 


4024 


334 


GAGGAAUC A CAGGGAGA 


617 


TCTCCCTG GGCTAGCTACAACGA GATTCCTC 


4025 


360 


AUGGGGCC A UUUAAGAG 


622 


CTCTTAAA GGCTAGCTACAACGA GGCCCCAT 


4026 


379 


CUGUGUUC A UCUUGAUU 


624 


AATCAAGA GGCTAGCTACAACGA GAACACAG 


4027 


392 


GAUUCUUC A CCUUCUAG 


627 


CTAGAAGG GGCTAGCTACAACGA GAAGAATC 


4028 


420 


AGUAAUUC A CUCAUUCA 


634 


TGAATGAG GGCTAGCTACAACGA GAATTACT 


4029 


424 


AUUCACUC A UUCAGCUG 


636 


CAGCTGAA GGCTAGCTACAACGA GAGTGAAT 


4030 


454 


AUGAAGGC A UUGUCGUU 


642 


AACGACAA GGCTAGCTACAACGA GCCTTCAT 


4031 


495 


GAUGAAAC A CUCAUUCA 


650 


TGAATGAG GGCTAGCTACAACGA GTTTCATC 


4032 


499 


AAACACUC A UUCAACAA 


652 


TTGTTGAA GGCTAGCTACAACGA GAGTGTTT 


4033 


517 


UAAAGGAC A UGGUGACC 


655 


GGTCACCA GGCTAGCTACAACGA GTCCTTTA 


4034 


531 


ACCCAGGC A UCUCUGUA 


659 


TACAGAGA GGCTAGCTACAACGA GCCTGGGT 


4035 


586 


AUGUUGCC A UUUUGAUU 


667 


AATCAAAA GGCTAGCTACAACGA GGCAACAT 


4036 


603 


CCUGAAAC A UGGAAGAC 


670 


GTCTTCCA GGCTAGCTACAACGA GTTTCAGG 


4037 


706 


AACCCUAC A CUGAGCAG 


692 


CTGCTCAG GGCTAGCTACAACGA GTAGGGTT 


4038 


749 


AAGGAUCC A CCUCACUC 


698 


GAGTGAGG GGCTAGCTACAACGA GGATCCTT 


4039 


754 


UCCACCUC A CUCCUGAU 


701 


ATCAGGAG GGCTAGCTACAACGA GAGGTGGA 


4040 


766 


CUGAUUUC A UUGCAGGA 


705 


TCCTGCAA GGCTAGCTACAACGA GAAATCAG 


4041 


798 


UAUGGACC A CAAGGUAA 


709 


TTACCTTG GGCTAGCTACAACGA GGTCCATA 


4042 


810 


GGUAAGGC A UUUGUCCA 


711 


TGGACAAA GGCTAGCTACAACGA GCCTTACC 


4043 


818 


AUUUGUCC A UGAGUGGG 


713 


CCCACTCA GGCTAGCTACAACGA GGACAAAT 


4044 


830 


GUGGGCUC A UCUACGAU 


715 


ATCGTAGA GGCTAGCTACAACGA GAGCCCAC 


4045 


970 


GCUGUUAC A CCAAAAGA 


731 


TCTTTTGG GGCTAGCTACAACGA GTAACAGC 


4046 


982 


AAAGAUGC A CAUUCAAU 


734 


ATTGAATG GGCTAGCTACAACGA GCATCTTT 


4047 


984 


AGAUGCAC A UUCAAUAA 


735 


TTATTGAA GGCTAGCTACAACGA GTGCATCT 


4048 


1071 


AUGUUUGC A CAACAUGU 


749 


ACATGTTG GGCTAGCTACAACGA GCAAACAT 


4049 


1076 


UGCACAAC A UGUUGAUU 


751 


AATCAACA GGCTAGCTACAACGA GTTGTGCA 


4050 


1115 


ACAAAACC A CAACAAAG 


757 


CTTTGTTG GGCTAGCTACAACGA GGTTTTGT 


4051 


1165 


UCCGAAGC A CAUGGGAA 


769 


TTCCCATG GGCTAGCTACAACGA GCTTCGGA 


4052 


1167 


CGAAGCAC A UGGGAAGU 


770 


ACTTCCCA GGCTAGCTACAACGA GTGCTTCG 


4053 


1207 


AGAAAACC A CUCCUAUG 


775 


CATAGGAG GGCTAGCTACAACGA GGTTTTCT 


4054 


1221 


AUGACAAC A CAGCCACC 


780 


GGTGGCTG GGCTAGCTACAACGA GTTGTCAT 


4055 


1227 


ACACAGCC A CCAAAUCC 


783 


GGATTTGG GGCTAGCTACAACGA GGCTGTGT 


4056 


1237 


CAAAUCCC A CCUUCUCA 


788 


TGAGAAGG GGCTAGCTACAACGA GGGATTTG 


4057 


1245 


ACCUUCUC A UUGCUGCA 


792 


TGCAGCAA GGCTAGCTACAACGA GAGAAGGT 


4058 


1300 


CUGGAAGC A UGGCGACU 


800 


AGTCGCCA GGCTAGCTACAACGA GCTTCCAG 


4059 


1395 


AUGGUGAC A UUUGACAG 


820 


CTGTCAAA GGCTAGCTACAACGA GTCACCAT 


4060 


1412 


UGCUGCCC A UGUACAAA 


825 


TTTGTACA GGCTAGCTACAACGA GGGCAGCA 


4061 


1429 


GUGAACUC A UACAGAUA 


828 


TATCTGTA GGCTAGCTACAACGA GAGTTCAC 


4062 


1459 


ACAGGGAC A CACUCGCC 


833 


GGCGAGTG GGCTAGCTACAACGA GTCCCTGT 


4063 


1461 


AGGGACAC A CUCGCCAA 


834 


TTGGCGAG GGCTAGCTACAACGA GTGTCCCT 


4064 


1504 


GGACGUCC A UCUGCAGC 


845 


GCTGCAGA GGCTAGCTACAACGA GGACGTCC 


4065 


1527 


CGAUCGGC A UUUACUGU 


849 


ACAGTAAA GGCTAGCTACAACGA GCCGATCG 


4066 


1600 


AAGACAAC A CUAUAAGU 


858 


ACTTATAG GGCTAGCTACAACGA GTTGTCTT 


4067 


1642 


GUGGUGCC A UCAUCCAC 


864 


GTGGATGA GGCTAGCTACAACGA GGCACCAC 


4068 


1645 


GUGCCAUC A UCCACACA 


865 


TGTGTGGA GGCTAGCTACAACGA GATGGCAC 


4069 


1649 


CAUCAUCC A CACAGUCG 


867 


CGACTGTG GGCTAGCTACAACGA GGATGATG 


4070 


1651 


UCAUCCAC A CAGUCGCU 


868 


AGCGACTG GGCTAGCTACAACGA GTGGATGA 


4071 


1722 


UUACAGAC A UAUGCUUC 


884 


GAAGCATA GGCTAGCTACAACGA GTCTGTAA 


4072 


1756 


AUGGCCUC A UUGAUGCU 


892 


AGCATCAA GGCTAGCTACAACGA GAGGCCAT 


4073 


1779 


GCCCUUUC A UCAGGAAA 


897 


TTTCCTGA GGCTAGCTACAACGA GAAAGGGC 


4074 


1810 


AGCGCUCC A UCCAGCUU 


905 


AAGCTGGA GGCTAGCTACAACGA GGAGCGCT 


4075 


1864 


UGAAUGGC A CAGUGAUC 


917 


GATCACTG GGCTAGCTACAACGA GCCATTCA 


4076 


1882 


UGGACAGC A CCGUGGGA 


920 


TCCCACGG GGCTAGCTACAACGA GCTGTCCA 


4077 


1897 


GAAAGGAC A CUUUGUUU 


922 


AAACAAAG GGCTAGCTACAACGA GTCCTTTC 


4078 


1912 


UUCUUAUC A CCUGGACA 


925 


TGTCCAGG GGCTAGCTACAACGA GATAAGAA 


4079 
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1993 


ACAAAAAC 


A 


CCAAAAUG 


947 


CATTTTGG 


GGCTAGCTACAACGA 


GTTTTTGT 


4080 


2023 


UCCCAGGC 


A 


UUGCUAAG 


959 


CTTAGCAA 


GGCTAGCTACAACGA 


GCCTGGGA 


4081 


2038 


AGGUUGGC 


A 


CUUGGAAA 


961 


TTTCCAAG 


GGCTAGCTACAACGA 


GCCAACCT 


4082 


2067 


GCAAGCUC 


A 


CAAACCUU 


968 


AAGGTTTG 


GGCTAGCTACAACGA 


GAGCTTGC 


4083 


2089 


UGACUGUC 


A 


CGUCCCGU 


976 


ACGGGACG 


GGCTAGCTACAACGA 


GACAGTCA 


4084 


2152 


ACAAGGAC 


A 


CCAGCAAA 


994 


TTTGCTGG 


GGCTAGCTACAACGA 


GTCCTTGT 


4085 


2230 


CCAGUGUC 


A 


CAGCCCUG 


1019 


CAGGGCTG 


GGCTAGCTACAACGA 


GACACTGG 


4086 


2338 


GGUAUUUC 


A 


CAACUUAU 


1037 


ATAAGTTG 


GGCTAGCTACAACGA 


GAAATACC 


4087 


2350 


CUUAUGAC 


A 


CGAAUGGU 


1040 


ACCATTCG 


GGCTAGCTACAACGA 


GTCATAAG 


4088 


2436 


AGUGGAGC 


A 


CUGUACAU 


1052 


ATGTACAG 


GGCTAGCTACAACGA 


GCTCCACT 


4089 


2443 


CACUGUAC 


A 


UACCUGGC 


1054 


GCCAGGTA 


GGCTAGCTACAACGA 


GTACAGTG 


4090 


. 2484 


UGGAAUCC 


A 


CCAAGACC 


1060 


GGTCTTGG 


GGCTAGCTACAACGA 


GGATTCCA 


4091 


2519 


UGUUCAAC 


A 


CAAGCAAG 


1066 


CTTGCTTG 


GGCTAGCTACAACGA 


GTTGAACA 


4092 


2544 


AGCAGAAC 


A 


UCCUCGGG 


1071 


CCCGAGGA 


GGCTAGCTACAACGA 


GTTCTGCT 


4093 


2559 


GGAGGCUC 


A 


UUUGUGGC 


1075 


GCCACAAA 


GGCTAGCTACAACGA 


GAGCCTCC 


4094 


2590 


AUGCUCCC 


A 


UACCUGAU 


1084 


ATCAGGTA 


GGCTAGCTACAACGA 


GGGAGCAT 


4095 


2607 


cucuuccc 


A 


CCUGGCCA 


1091 


TGGCCAGG 


GGCTAGCTACAACGA 


GGGAAGAG 


4096 


2620 


GCCAAAUC 


A 


CCGACCUG 


1096 


CAGGTCGG 


GGCTAGCTACAACGA 


GATTTGGC 


4097 


2642 


GGAAAUUC 


A 


CGGGGGCA 


1100 


TGCCCCCG 


GGCTAGCTACAACGA 


GAATTTCC 


4098 


2656 


GCAGUCUC 


A 


UUAAUCUG 


1103 


CAGATTAA 


GGCTAGCTACAACGA 


GAGACTGC 


4099 


2696 


UUAUGACC 


A 


UGGAACAG 


1111 


CTGTTCCA 


GGCTAGCTACAACGA 


GGTCATAA 


4100 


2708 


AACAGCUC 


A 


CAAGUAUA 


1114 


TATACTTG 


GGCTAGCTACAACGA 


GAGCTGTT 


4101 


2719 


AGUAUAUC 


A 


UUCGAAUA 


1116 


TATTCGAA 


GGCTAGCTACAACGA 


GATATACT 


4102 


2794 


CUGCUCUC 


A 


UCCCAAAG 


1130 


CTTTGGGA 


GGCTAGCTACAACGA 


G AG AG C AG 


4103 


2845 


CAGAAAAC 


A 


UUACUUUU 


1141 


AAAAGTAA 


GGCTAGCTACAACGA 


GTTTTCTG 


4104 


2863 


AAAAUGGC 


A 


CAGAUCUU 


1143 


AAGATCTG 


GGCTAGCTACAACGA 


GCCATTTT 


4105 


2875 


AUCUUUUC 


A 


UUGCUAUU 


1146 


AATAGCAA 


GGCTAGCTACAACGA 


GAAAAGAT 


4106 


2926 


UAUCCAAC 


A 


UUGCACGA 


1154 


TCGTGCAA 


GGCTAGCTACAACGA 


GTTGGATA 


4107 


2931 


AACAUUGC 


A 


CGAGUAUC 


1155 


GATACTCG 


GGCTAGCTACAACGA 


GCAATGTT 


4108 


2955 


AUUCCUCC 


A 


CAGACUCC 


1160 


GGAGTCTG 


GGCTAGCTACAACGA 


GGAGGAAT 


4109 


2973 


CCAGAGAC 


A 


CCUAGUCC 


1166 


GGACTAGG 


GGCTAGCTACAACGA 


GTCTCTGG 


4110 


3014 


UAAUAUUC 


A 


UAUCAACA 


1177 


TGTTGATA 


GGCTAGCTACAACGA 


GAATATTA 


4111 


3025 


UCAACAGC 


A 


CCAUUCCU 


1180 


AGGAATGG 


GGCTAGCTACAACGA 


GCTGTTGA 


4112 


3028 


ACAGCACC 


A 


UUCCUGGC 


1182 


GCCAGGAA 


GGCTAGCTACAACGA 


GGTGCTGT 


4113 


3037 


UUCCUGGC 


A 


UUCACAUU 


1185 


AATGTGAA 


GGCTAGCTACAACGA 


GCCAGGAA 


4114 


3041 


UGGCAUUC 


A 


CAUUUUAA 


1186 


TTAAAATG 


GGCTAGCTACAACGA 


GAATGCCA 


4115 


3043 


GCAUUCAC 


A 


UUUUAAAA 


1187 


TTTTAAAA 


GGCTAGCTACAACGA 


GTGAATGC 


4116 


3130 


AAUAAAUC 


A 


UUCAUCCU 


1196 


AGGATGAA 


GGCTAGCTACAACGA 


GATTTATT 


4117 


3134 


AAUCAUUC 


A 


UCCUUUUU 


1197 


AAAAAGGA 


GGCTAGCTACAACGA 


GAATGATT 


4118 


3214 


UAUAGUAC 


A 


UUUAUACU 


1205 


AGTATAAA 


GGCTAGCTACAACGA 


GTACTATA 


4119 


134 


ACAAGUAC 


G 


CAAUUUGA 


1215 


TCAAATTG 


GGCTAGCTACAACGA 


GTACTTGT 


4120 


312 


CGUAACCC 


G 


CAUUUUCC 


1220 


GGAAAATG 


GGCTAGCTACAACGA 


GGGTTACG 


4121 


463 


UUGUCGUU 


G 


CAAUCGAC 


1225 


GTCGATTG 


GGCTAGCTACAACGA 


AACGACAA 


4122 


480 


CCCAAUGU 


G 


CCAGAAGA 


1227 


TCTTCTGG 


GGCTAGCTACAACGA 


ACATTGGG 


4123 


583 


AAAAUGUU 


G 


CCAUUUUG 


1232 


CAAAATGG 


GGCTAGCTACAACGA 


AACATTTT 


4124 


655 


ACAAAAAU 


G 


CUGAUGUU 


1238 


AACATCAG 


GGCTAGCTACAACGA 


ATTTTTGT 


4125 


670 


UUCUGGUU 


G 


CUGAGUCU 


1240 


AGACTCAG 


GGCTAGCTACAACGA 


AACCAGAA 


4126 


769 


AUUUCAUU 


G 


CAGGAAAA 


1247 


TTTTCCTG 


GGCTAGCTACAACGA 


AATGAAAT 


4127 


980 


CAAAAGAU 


G 


CACAUUCA 


1255 


TGAATGTG 


GGCTAGCTACAACGA 


ATCTTTTG 


4128 


1040 


CCAAUCCC 


G 


CCAGACGG 


1258 


CCGTCTGG 


GGCTAGCTACAACGA 


GGGATTGG 


4129 


1069 


UAAUGUUU 


G 


CACAACAU 


1259 


ATGTTGTG 


GGCTAGCTACAACGA 


AAACATTA 


4130 


1151 


UCAAAAAU 


G 


CAAUCUCC 


1262 


GGAGATTG 


GGCTAGCTACAACGA 


ATTTTTGA 


4131 


T "i A O 

124 8 


UUCUCAUU 


G 


CUGCAGAU 


12 68 


ATCTGCAG 


GGCTAGCTACAACGA 


AATGAGAA 


4132 


1251 


UCAUUGCU 


G 


CAGAUUGG 


1269 


CCAATCTG 


GGCTAGCTACAACGA 


AGCAATGA 


4133 


1316 


UGGUAACC 


G 


CCUCAAUC 


1272 


GATTGAGG 


GGCTAGCTACAACGA 


GGTTACCA 


4134 


1353 


cuuuuccu 


G 


CUGCAGAC 


1275 


GTCTGCAG 


GGCTAGCTACAACGA 


AGGAAAAG 


4135 
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1356 


UUCCUGCU 


G 


CAGACAGU 


1276 


ACTGTCTG 


GGCTAGCTACAACGA 


AGCAGGAA 


4136 


1405 


UUGACAGU 


G 


CUGCCCAU 


1280 


ATGGGCAG 


GGCTAGCTACAACGA 


ACTGTCAA 


4137 


1408 


ACAGUGCU 


G 


CCCAUGUA 


1281 


TACATGGG 


GGCTAGCTACAACGA 


AGCACTGT 


4138 


1465 


ACACACUC 


G 


CCAAAAGA 


1284 


TCTTTTGG 


GGCTAGCTACAACGA 


GAGTGTGT 


4139 


1480 


GAUUACCU 


G 


CAGCAGCU 


1285 


AGCTGCTG 


GGCTAGCTACAACGA 


AGGTAATC 


4140 


1508 


GUCCAUCU 


G 


CAGCGGGC 


1286 


GCCCGCTG 


GGCTAGCTACAACGA 


AGATGGAC 


4141 


1575 


GAAAUUGU 


G 


CUGCUGAC 


1291 


GTCAGCAG 


GGCTAGCTACAACGA 


ACAATTTC 


4142 


1578 


AUUGUGCU 


G 


CUGACGGA 


1292 


TCCGTCAG 


GGCTAGCTACAACGA 


AGCACAAT 


4143 


1613 


AAGUGGGU 


G 


CUUUAACG 


1294 


CGTTAAAG 


GGCTAGCTACAACGA 


ACCCACTT 


4144 


1639 


AAAGUGGU 


G 


CCAUCAUC 


1296 


GATGATGG 


GGCTAGCTACAACGA 


ACCACTTT 


4145 


1657 


ACACAGUC 


G 


CUUUGGGG 


1297 


CCCCAAAG 


GGCTAGCTACAACGA 


GACTGTGT 


4146 


1672 


GGCCCUCU 


G 


CAGCUCAA 


1298 


TTGAGCTG 


GGCTAGCTACAACGA 


AGAGGGCC 


4147 


1726 


AGACAUAU 


G 


CUUCAGAU 


1300 


ATCTGAAG 


GGCTAGCTACAACGA 


ATATGTCT 


4148 


1762 


UCAUUGAU 


G 


CUUUUGGG 


1302 


CCCAAAAG 


GGCTAGCTACAACGA 


ATCAATGA 


4149 


1805 


CUCUCAGC 


G 


CUCCAUCC 


1303 


GGATGGAG 


GGCTAGCTACAACGA 


GCTGAGAG 


4150 


1923 


UGGACAAC 


G 


CAGCCUCC 


1307 


GGAGGCTG 


GGCTAGCTACAACGA 


GTTGTCCA 


4151 


2026 


CAGGCAUU 


G 


CUAAGGUU 


1308 


AACCTTAG 


GGCTAGCTACAACGA 


AATGCCTG 


4152 


2055 


UACAGUCU 


G 


CAAGCAAG 


1309 


CTTGCTTG 


GGCTAGCTACAACGA 


AGACTGTA 


4153 


2098 


CGUCCCGU 


G 


CGUCCAAU 


1312 


ATTGGACG 


GGCTAGCTACAACGA 


ACGGGACG 


4154 


2107 


CGUCCAAU 


G 


CUACCCUG 


1313 


CAGGGTAG 


GGCTAGCTACAACGA 


ATTGGACG 


4155 


2115 


GCUACCCU 


G 


CCUCCAAU 


1314 


ATTGGAGG 


GGCTAGCTACAACGA 


AGGGTAGC 


4156 


2185 


UAGUUUAU 


G 


CAAAUAUU 


1317 


AATATTTG 


GGCTAGCTACAACGA 


ATAAACTA 


4157 


2195 


AAAUAUUC 


G 


CCAAGGAG 


1318 


CTCCTTGG 


GGCTAGCTACAACGA 


GAATATTT 


4158 


2296 


GAGCAGGU 


G 


CUGAUGCU 


1322 


AGCATCAG 


GGCTAGCTACAACGA 


ACCTGCTC 


4159 


2302 


GUGCUGAU 


G 


CUACUAAG 


1324 


CTTAGTAG 


GGCTAGCTACAACGA 


ATCAGCAC 


4160 


2376 


GUAAAAGU 


G 


CGGGCUCU 


1328 


AGAGCCCG 


GGCTAGCTACAACGA 


ACTTTTAC 


4161 


2398 


GAGUUAAC 


G 


CAGCCAGA 


1329 


TCTGGCTG 


GGCTAGCTACAACGA 


GTTAACTC 


4162 


2584 


UCCCAAAU 


G 


CUCCCAUA 


1337 


TATGGGAG 


GGCTAGCTACAACGA 


ATTTGGGA 


4163 


2788 


AUACUACU 


G 


CUCUCAUC 


1348 


GATGAGAG 


GGCTAGCTACAACGA 


AGTAGTAT 


4164 


2878 


UUUUCAUU 


G 


CUAUUCAG 


1351 


CTGAATAG 


GGCTAGCTACAACGA 


AATGAAAA 


4165 


2929 


CCAACAUU 


G 


CACGAGUA 


1355 


TACTCGTG 


GGCTAGCTACAACGA 


AATGTTGG 


4166 


2964 


CAGACUCC 


G 


CCAGAGAC 


1357 


GTCTCTGG 


GGCTAGCTACAACGA 


GGAGTCTG 


4167 


2995 


AAACGUCU 


G 


CUCCUUGU 


1360 


ACAAGGAG 


GGCTAGCTACAACGA 


AGACGTTT 


4168 


3078 


GGAGAACU 


G 


CAGCUGUC 


1361 


GACAGCTG 


GGCTAGCTACAACGA 


AGTTCTCC 


4169 


3294 


AAUAAAAU 


G 


CUAAACAA 


1366 


TTGTTTAG 


GGCTAGCTACAACGA 


ATTTTATT 


4170 


27 


AAAUGGAU 


G 


UGGAAUAU 


1367 


ATATTCCA 


GGCTAGCTACAACGA 


ATCCATTT 


4171 


52 


AUUUUCUU 


G 


UUUAAGGG 


1368 


CCCTTAAA 


GGCTAGCTACAACGA 


AAGAAAAT 


4172 


75 


GAAGAGGU 


G 


UUGAGGUU 


1369 


AACCTCAA 


GGCTAGCTACAACGA 


ACCTCTTC 


4173 


86 


GAGGUUAU 


G 


UCAAGCAU 


1370 


ATGCTTGA 


GGCTAGCTACAACGA 


ATAACCTC 


4174 


155 


AAGAUAUU 


G 


UUAUCAUU 


1371 


AATGATAA 


GGCTAGCTACAACGA 


AATATCTT 


4175 


221 


AAAGACCU 


G 


UGAUAAAC 


1372 


GTTTATCA 


GGCTAGCTACAACGA 


AGGTCTTT 


4176 


253 


GGAAACGU 


G 


UGUCUAUA 


1373 


TATAGACA 


GGCTAGCTACAACGA 


ACGTTTCC 


4177 


255 


AAACGUGU 


G 


UCUAUAUU 


1374 


AATATAGA 


GGCTAGCTACAACGA 


ACACGTTT 


4178 


273 


UCAUAUCU 


G 


UAUAUAUA 


1375 


TATATATA 


GGCTAGCTACAACGA 


AGATATGA 


4179 


344 


AGGGAGAU 


G 


UACAGCAA 


1376 


TTGCTGTA 


GGCTAGCTACAACGA 


ATCTCCCT 


4180 


373 


AGAGUUCU 


G 


UGUUCAUC 


1377 


GATGAACA 


GGCTAGCTACAACGA 


AGAACTCT 


4181 


375 


AGUUCUGU 


G 


UUCAUCUU 


1378 


AAGATGAA 


GGCTAGCTACAACGA 


ACAGAACT 


4182 


457 


AAGGCAUU 


G 


UCGUUGCA 


1379 


TGCAACGA 


GGCTAGCTACAACGA 


AATGCCTT 


4183 


478 


ACCCCAAU 


G 


UGCCAGAA 


1380 


TTCTGGCA 


GGCTAGCTACAACGA 


ATTGGGGT 


4184 


537 


GCAUCUCU 


G 


UAUCUGUU 


1381 


AACAGATA 


GGCTAGCTACAACGA 


AGAGATGC 


4185 


543 


CUGUAUCU 


G 


UUUGAAGC 


1382 


GCTTCAAA 


GGCTAGCTACAACGA 


AGATACAG 


4186 


580 


UCAAAAAU 


G 


UUGCCAUU 


1383 


AATGGCAA 


GGCTAGCTACAACGA 


ATTTTTGA 


4187 


C C 


CUGACUAU 


G 


UGAGACCA 


13 84 


TGGTCTCA 


GGCTAGCTACAACGA 


ATAGTCAG 


4188 


661 


AUGCUGAU 


G 


UUCUGGUU 


1385 


AACCAGAA 


GGCTAGCTACAACGA 


ATCAGCAT 


4189 


725 


GGGCAACU 


G 


UGGAGAGA 


1386 


TCTCTCCA 


GGCTAGCTACAACGA 


AGTTGCCC 


4190 


814 


AGGCAUUU 


G 


UCCAUGAG 


1387 


CTCATGGA 


GGCTAGCTACAACGA 


AAATGCCT 


4191 



106 



911 


AGUAAGAU G UUCAGCAG 


1388 


CTGCTGAA GGCTAGCTACAACGA ATCTTACT 


4192 


937 


GUACAAAU G UAGUAAAG 


1389 


CTTTACTA GGCTAGCTACAACGA ATTTGTAC 


4193 


950 


AAAGAAGU G UCAGGGAG 


1390 


CTCCCTGA GGCTAGCTACAACGA ACTTCTTT 


4194 


965 


AGGCAGCU G UUACACCA 


1391 


TGGTGTAA GGCTAGCTACAACGA AGCTGCCT 


4195 


1019 


AAAAGGAU G UGAGUUUG 


1392 


CAAACTCA GGCTAGCTACAACGA ATCCTTTT 


4196 


1027 


GUGAGUUU G UUCUCCAA 


1393 


TTGGAGAA GGCTAGCTACAACGA AAACTCAC 


4197 


1065 


UCUAUAAU G UUUGCACA 


1394 


TGTGCAAA GGCTAGCTACAACGA ATTATAGA 


4198 


1078 


CACAACAU G UUGAUUCU 


1395 


AGAATCAA GGCTAGCTACAACGA ATGTTGTG 


4199 


1100 


UGAAUUCU G UACAGAAC 


1396 


GTTCTGTA GGCTAGCTACAACGA AGAATTCA 


4200 


1270 


AAAGAAUU G UGUGUUUA 


1397 


TAAACACA GGCTAGCTACAACGA AATTCTTT 


4201 


1272 


AGAAUUGU G UGUUUAGU 


1398 


ACTAAACA GGCTAGCTACAACGA ACAATTCT 


4202 


1274 


AAUUGUGU G UUUAGUCC 


1399 


GGACTAAA GGCTAGCTACAACGA ACACAATT 


4203 


1414 


CUGCCCAU G UACAAAGU 


1400 


ACTTTGTA GGCTAGCTACAACGA ATGGGCAG 


4204 


1534 


CAUUUACU G UGAUUAGG 


1401 


CCTAATCA GGCTAGCTACAACGA AGTAAATG 


4205 


1573 


CUGAAAUU G UGCUGCUG 


1402 


CAGCAGCA GGCTAGCTACAACGA AATTTCAG 


4206 


1695 


GAGGAGCU G UCCAAAAU 


1403 


ATTTTGGA GGCTAGCTACAACGA AGCTCCTC 


4207 


1795 


AUGGAGCU G UCUCUCAG 


1404 


CTGAGAGA GGCTAGCTACAACGA AGCTCCAT 


4208 


1902 


GACACUUU G UUUCUUAU 


1405 


ATAAGAAA GGCTAGCTACAACGA AAAGTGTC 


4209 


1978 


GUGGCUUU G UAGUGGAC 


1406 


GTCCACTA GGCTAGCTACAACGA AAAGCCAC 


4210 


2086 


CCCUGACU G UCACGUCC 


1407 


GGACGTGA GGCTAGCTACAACGA AGTCAGGG 


4211 


2227 


GGGCCAGU G UCACAGCC 


1408 


GGCTGTGA GGCTAGCTACAACGA ACTGGCCC 


4212 


2320 


AUGACGGU G UCUACUCA 


1409 


TGAGTAGA GGCTAGCTACAACGA ACCGTCAT 


4213 


2368 


GAUACAGU G UAAAAGUG 


1410 


CACTTTTA GGCTAGCTACAACGA ACTGTATC 


4214 


2439 


GGAGCACU G UACAUACC 


1411 


GGTATGTA GGCTAGCTACAACGA AGTGCTCC 


4215 


2512 


AGGAUGAU G UUCAACAC 


1412 


GTGTTGAA GGCTAGCTACAACGA ATCATCCT 


4216 


2529 


AAGCAAGU G UGUUUCAG 


1413 


CTGAAACA GGCTAGCTACAACGA ACTTGCTT 


4217 


2531 


GCAAGUGU G UUUCAGCA 


1414 


TGCTGAAA GGCTAGCTACAACGA ACACTTGC 


4218 


2563 


GCUCAUUU G UGGCUUCU 


1415 


AGAAGCCA GGCTAGCTACAACGA AAATGAGC 


4219 


2575 


CUUCUGAU G UCCCAAAU 


1416 


ATTTGGGA GGCTAGCTACAACGA ATCAGAAG 


4220 


2829 


GUCUUUUU G UUUAAACC 


1417 


GGTTTAAA GGCTAGCTACAACGA AAAAAGAC 


4221 


2890 


UUCAGGCU G UUGAUAAG 


1418 


CTTATCAA GGCTAGCTACAACGA AGCCTGAA 


4222 


2943 


GUAUCUUU G UUUAUUCC 


1419 


GGAATAAA GGCTAGCTACAACGA AAAGATAC 


4223 


3002 


UGCUCCUU G UCCUAAUA 


1420 


TATTAGGA GGCTAGCTACAACGA AAGGAGCA 


4224 


3057 


AAAAUUAU G UGGAAGUG 


1421 


CACTTCCA GGCTAGCTACAACGA ATAATTTT 


4225 


3084 


CUGCAGCU G UCAAUAGC 


1422 


GCTATTGA GGCTAGCTACAACGA AGCTGCAG 


4226 


3109 


GAAUUUUU G UCAGAUAA 


1423 


TTATCTGA GGCTAGCTACAACGA AAAAATTC 


4227 


3166 


UCUAAAAU G UAUUUUAG 


1424 


CTAAAATA GGCTAGCTACAACGA ATTTTAGA 


4228 


3182 


GACUUCCU G UAGGGGGC 


1425 


GCCCCCTA GGCTAGCTACAACGA AGGAAGTC 


4229 


3272 


GACUUCCU G UAGGGGGC 


1425 


GCCCCCTA GGCTAGCTACAACGA AGGAAGTC 


4229 


3203 


UACUAAAU G UAUAUAGU 


1426 


ACTATATA GGCTAGCTACAACGA ATTTAGTA 


4230 


3227 


UACUAAAU G UAUUCCUG 


1427 


CAGGAATA GGCTAGCTACAACGA ATTTAGTA 


4231 


3235 


GUAUUCCU G UAGGGGGC 


1428 


GCCCCCTA GGCTAGCTACAACGA AGGAATAC 


4232 


3256 


UACUAAAU G UAUUUUAG 


1429 


CTAAAATA GGCTAGCTACAACGA ATTTAGTA 


4233 


15 


UGCUUUUG G UACAAAUG 


1430 


CATTTGTA GGCTAGCTACAACGA CAAAAGCA 


4234 


63 


UAAGGGGA G CAUGAAGA 


1431 


TCTTCATG GGCTAGCTACAACGA TCCCCTTA 


4235 


73 


AUGAAGAG G UGUUGAGG 


1432 


CCTCAACA GGCTAGCTACAACGA CTCTTCAT 


4236 


81 


GUGUUGAG G UUAUGUCA 


1433 


TGACATAA GGCTAGCTACAACGA CTCAACAC 


4237 


91 


UAUGUCAA G CAUCUGGC 


1434 


GCCAGATG GGCTAGCTACAACGA TTGACATA 


4238 


98 


AGCAUCUG G CACAGCUG 


1435 


CAGCTGTG GGCTAGCTACAACGA CAGATGCT 


4239 


103 


CUGGCACA G CUGAAGGC 


1436 


GCCTTCAG GGCTAGCTACAACGA TGTGCCAG 


4240 


110 


AGCUGAAG G CAGAUGGA 


1437 


TCCATCTG GGCTAGCTACAACGA CTTCAGCT 


4241 


130 


AUUUACAA G UACGCAAU 


1438 


ATTGCGTA GGCTAGCTACAACGA TTGTAAAT 


4242 


182 


AGACAAGA G CAAUAGUA 


1439 


TACTATTG GGCTAGCTACAACGA TCTTGTCT 


4243 


188 


GAGCAAUA G UAAAACAC 


1440 


GTGTTTTA GGCTAGCTACAACGA TATTGCTC 


4244 


202 


CACAUCAG G UCAGGGGG 


1441 


CCCCCTGA GGCTAGCTACAACGA CTGATGTG 


4245 


210 


GUCAGGGG G UUAAAGAC 


1442 


GTCTTTAA GGCTAGCTACAACGA CCCCTGAC 


4246 
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242 


UCCGAUAA G UUGGAAAC 


1443 


GTTTCCAA GGCTAGCTACAACGA TTATCGGA 


4247 


251 


UUGGAAAC G UGUGUCUA 


1444 


TAGACACA GGCTAGCTACAACGA GTTTCCAA 


4248 


287 


AUAUAAUG G UAAAGAAA 


1445 


TTTCTTTA GGCTAGCTACAACGA CATTATAT 


4249 


305 


ACACCUUC G UAACCCGC 


1446 


GCGGGTTA GGCTAGCTACAACGA GAAGGTGT 


4250 


349 


GAUGUACA G CAAUGGGG 


1447 


CCCCATTG GGCTAGCTACAACGA TGTACATC 


4251 


357 


GCAAUGGG G CCAUUUAA 


1448 


TTAAATGG GGCTAGCTACAACGA CCCATTGC 


4252 


368 


AUUUAAGA G UUCUGUGU 


1449 


ACACAGAA GGCTAGCTACAACGA TCTTAAAT 


4253 


406 


UAGAAGGG G CCCUGAGU 


1450 


ACTCAGGG GGCTAGCTACAACGA CCCTTCTA 


4254 


413 


GGCCCUGA G UAAUUCAC 


1451 


GTGAATTA GGCTAGCTACAACGA TCAGGGCC 


4255 


429 


CUCAUUCA G CUGAACAA 


1452 


TTGTTCAG GGCTAGCTACAACGA TGAATGAG 


4256 


443 


CAACAAUG G CUAUGAAG 


1453 


CTTCATAG GGCTAGCTACAACGA CATTGTTG 


4257 


452 


CUAUGAAG G CAUUGUCG 


1454 


CGACAATG GGCTAGCTACAACGA CTTCATAG 


4258 


460 


GCAUUGUC G UUGCAAUC 


1455 


GATTGCAA GGCTAGCTACAACGA GACAATGC 


4259 


520 


AGGACAUG G UGACCCAG 


1456 


CTGGGTCA GGCTAGCTACAACGA CATGTCCT 


4260 


529 


UGACCCAG G CAUCUCUG 


1457 


CAGAGATG GGCTAGCTACAACGA CTGGGTCA 


4261 


550 


UGUUUGAA G CUACAGGA 


1458 


TCCTGTAG GGCTAGCTACAACGA TTCAAACA 


4262 


561 


ACAGGAAA G CGAUUUUA 


1459 


TAAAATCG GGCTAGCTACAACGA TTTCCTGT 


4263 


616 


AGACAAAG G CUGACUAU 


1460 


ATAGTCAG GGCTAGCTACAACGA CTTTGTCT 


4264 


667 


AUGUUCUG G UUGCUGAG 


1461 


CTCAGCAA GGCTAGCTACAACGA CAGAACAT 


4265 


675 


GUUGCUGA G UCUACUCC 


1462 


GGAGTAGA GGCTAGCTACAACGA TCAGCAAC 


4266 


689 


UCCUCCAG G UAAUGAUG 


1463 


CATCATTA GGCTAGCTACAACGA CTGGAGGA 


4267 


711 


UACACUGA G CAGAUGGG 


1464 


CCCATCTG GGCTAGCTACAACGA TCAGTGTA 


4268 


719 


GCAGAUGG G CAACUGUG 


1465 


CACAGTTG GGCTAGCTACAACGA CCATCTGC 


4269 


737 


AGAGAAGG G UGAAAGGA 


1466 


TCCTTTCA GGCTAGCTACAACGA CCTTCTCT 


4270 


780 


GGAAAAAA G UUAGCUGA 


1467 


TCAGCTAA GGCTAGCTACAACGA TTTTTTCC 


4271 


784 


AAAAGUUA G CUGAAUAU 


1468 


ATATTCAG GGCTAGCTACAACGA TAACTTTT 


4272 


803 


ACCACAAG G UAAGGCAU 


1469 


ATGCCTTA GGCTAGCTACAACGA CTTGTGGT 


4273 


808 


AAGGUAAG G CAUUUGUC 


1470 


GACAAATG GGCTAGCTACAACGA CTTACCTT 


4274 


822 


GUCCAUGA G UGGGCUCA 


1471 


TGAGCCCA GGCTAGCTACAACGA TCATGGAC 


4275 


826 


AUGAGUGG G CUCAUCUA 


1472 


TAGATGAG GGCTAGCTACAACGA CCACTCAT 


4276 


844 


GAUGGGGA G UAUUUGAC 


1473 


GTCAAATA GGCTAGCTACAACGA TCCCCATC 


4277 


855 


UUUGACGA G UACAAUAA 


1474 


TTATTGTA GGCTAGCTACAACGA TCGTCAAA 


4278 


901 


GAAUACAA G CAGUAAGA 


1475 


TCTTACTG GGCTAGCTACAACGA TTGTATTC 


4279 


904 


UACAAGCA G UAAGAUGU 


1476 


ACATCTTA GGCTAGCTACAACGA TGCTTGTA 


4280 


916 


GAUGUUCA G CAGGUAUU 


1477 


AATACCTG GGCTAGCTACAACGA TGAACATC 


4281 


920 


UUCAGCAG G UAUUACUG 


1478 


CAGTAATA GGCTAGCTACAACGA CTGCTGAA 


4282 


92 9 


UAUUACUG G UACAAAUG 


1479 


CATTTGTA GGCTAGCTACAACGA CAGTAATA 


4283 


940 


CAAAUGUA G UAAAGAAG 


1480 


CTTCTTTA GGCTAGCTACAACGA TACATTTG 


4284 


948 


GUAAAGAA G UGUCAGGG 


1481 


CCCTGACA GGCTAGCTACAACGA TTCTTTAC 


4285 


959 


UCAGGGAG G CAGCUGUU 


1482 


AACAGCTG GGCTAGCTACAACGA CTCCCTGA 


4286 


962 


GGGAGGCA G CUGUUACA 


1483 


TGTAACAG GGCTAGCTACAACGA TGCCTCCC 


4287 


994 


UCAAUAAA G UUACAGGA 


1484 


TCCTGTAA GGCTAGCTACAACGA TTTATTGA 


4288 


1023 


GGAUGUGA G UUUGUUCU 


1485 


AGAACAAA GGCTAGCTACAACGA TCACATCC 


4289 


1054 


CGGAGAAG G CUUCUAUA 


1486 


TATAGAAG GGCTAGCTACAACGA CTTCTCCG 


4290 


1090 


AUUCUAUA G UUGAAUUC 


1487 


GAATTCAA GGCTAGCTACAACGA TATAGAAT 


4291 


1126 


ACAAAGAA G CUCCAAAC 


1488 


GTTTGGAG GGCTAGCTACAACGA TTCTTTGT 


4292 


1137 


CCAAACAA G CAAAAUCA 


1489 


TGATTTTG GGCTAGCTACAACGA TTGTTTGG 


4293 


1163 


UCUCCGAA G CACAUGGG 


1490 


CCCATGTG GGCTAGCTACAACGA TTCGGAGA 


4294 


1174 


CAUGGGAA G UGAUCCGU 


1491 


ACGGATCA GGCTAGCTACAACGA TTCCCATG 


4295 


1181 


AGUGAUCC G UGAUUCUG 


1492 


CAGAATCA GGCTAGCTACAACGA GGATCACT 


4296 


1224 


ACAACACA G CCACCAAA 


1493 


TTTGGTGG GGCTAGCTACAACGA TGTGTTGT 


4297 


1279 


UGUGUUUA G UCCUUGAC 


1494 


GTCAAGGA GGCTAGCTACAACGA TAAACACA 


4298 


1298 


AUCUGGAA G CAUGGCGA 


1495 


TCGCCATG GGCTAGCTACAACGA TTCCAGAT 


4299 


1303 


GAAGCAUG G CGACUGGU 


1496 


ACCAGTCG GGCTAGCTACAACGA CATGCTTC 


4300 


1310 


GGCGACUG G UAACCGCC 


1497 


GGCGGTTA GGCTAGCTACAACGA CAGTCGCC 


4301 


1336 


UGAAUCAA G CAGGCCAG 


1498 


CTGGCCTG GGCTAGCTACAACGA TTGATTCA 


4302 
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1340 


UCAAGCAG 


G 


CCAGCUUU 


1499 


AAAGCTGG 


GGCTAGCTACAACGA 


CTGCTTGA 


4303 


1344 


GCAGGCCA 


G 


CUUUUCCU 


1500 


AGGAAAAG 


GGCTAGCTACAACGA 


TGGCCTGC 


4304 


1363 


UGCAGACA 


G 


UUGAGCUG 


1501 


CAGCTCAA 


GGCTAGCTACAACGA 


TGTCTGCA 


4305 


1368 


ACAGUUGA 


G 


CUGGGGUC 


1502 


GACCCCAG 


GGCTAGCTACAACGA 


TCAACTGT 


4306 


1374 


GAGCUGGG 


G 


UCCUGGGU 


1503 


ACCCAGGA 


GGCTAGCTACAACGA 


CCCAGCTC 


4307 


1381 


GGUCCUGG 


G 


UUGGGAUG 


1504 


CATCCCAA 


GGCTAGCTACAACGA 


CCAGGACC 


4308 


1390 


UUGGGAUG 


G 


UGACAUUU 


1505 


AAATGTCA 


GGCTAGCTACAACGA 


CATCCCAA 


4309 


1403 


AUUUGACA 


G 


UGCUGCCC 


1506 


GGGCAGCA 


GGCTAGCTACAACGA 


TGTCAAAT 


4310 


1421 


UGUACAAA 


G 


UGAACUCA 


1507 


TGAGTTCA 


GGCTAGCTACAACGA 


TTTGTACA 


4311 


1442 


GAUAAACA 


G 


UGGCAGUG 


1508 


CACTGCCA 


GGCTAGCTACAACGA 


TGTTTATC 


4312 


1445 


AAACAGUG 


G 


CAGUGACA 


1509 


TGTCACTG 


GGCTAGCTACAACGA 


CACTGTTT 


4313 


1448 


CAGUGGCA 


G 


UGACAGGG 


1510 


CCCTGTCA 


GGCTAGCTACAACGA 


TGCCACTG 


4314 


1483 


UACCUGCA 


G 


CAGCUUCA 


1511 


TGAAGCTG 


GGCTAGCTACAACGA 


TGCAGGTA 


4315 


1486 


CUGCAGCA 


G 


CUUCAGGA 


1512 


TCCTGAAG 


GGCTAGCTACAACGA 


TGCTGCAG 


4316 


1500 


GGAGGGAC 


G 


UCCAUCUG 


1513 


CAGATGGA 


GGCTAGCTACAACGA 


GTCCCTCC 


4317 


1511 


CAUCUGCA 


G 


CGGGCUUC 


1514 


GAAGCCCG 


GGCTAGCTACAACGA 


TGCAGATG 


4318 


1515 


UGCAGCGG 


G 


CUUCGAUC 


1515 


GATCGAAG 


GGCTAGCTACAACGA 


CCGCTGCA 


4319 


1525 


UUCGAUCG 


G 


CAUUUACU 


1516 


AGTAAATG 


GGCTAGCTACAACGA 


CGATCGAA 


4320 


1607 


CACUAUAA 


G 


UGGGUGCU 


1517 


AGCACCCA 


GGCTAGCTACAACGA 


TTATAGTG 


4321 


1611 


AUAAGUGG 


G 


UGCUUUAA 


1518 


TTAAAGCA 


GGCTAGCTACAACGA 


CCACTTAT 


4322 


1624 


UUAACGAG 


G 


UCAAACAA 


1519 


TTGTTTGA 


GGCTAGCTACAACGA 


CTCGTTAA 


4323 


1634 


CAAACAAA 


G 


UGGUGCCA 


1520 


TGGCACCA 


GGCTAGCTACAACGA 


TTTGTTTG 


4324 


1637 


ACAAAGUG 


G 


UGCCAUCA 


1521 


TGATGGCA 


GGCTAGCTACAACGA 


CACTTTGT 


4325 


1654 


UCCACACA 


G 


UCGCUUUG 


1522 


CAAAGCGA 


GGCTAGCTACAACGA 


TGTGTGGA 


4326 


1665 


GCUUUGGG 


G 


CCCUCUGC 


1523 


GCAGAGGG 


GGCTAGCTACAACGA 


CCCAAAGC 


4327 


1675 


CCUCUGCA 


G 


CUCAAGAA 


1524 


TTCTTGAG 


GGCTAGCTACAACGA 


TGCAGAGG 


4328 


1692 


CUAGAGGA 


G 


CUGUCCAA 


1525 


TTGGACAG 


GGCTAGCTACAACGA 


TCCTCTAG 


4329 


1712 


GACAGGAG 


G 


UUUACAGA 


1526 


TCTGTAAA 


GGCTAGCTACAACGA 


CTCCTGTC 


4330 


1738 


CAGAUCAA 


G 


UUCAGAAC 


1527 


GTTCTGAA 


GGCTAGCTACAACGA 


TTGATCTG 


4331 


1751 


GAACAAUG 


G 


CCUCAUUG 


1528 


CAATGAGG 


GGCTAGCTACAACGA 


CATTGTTC 


4332 


1771 


CUUUUGGG 


G 


CCCUUUCA 


1529 


TGAAAGGG 


GGCTAGCTACAACGA 


CCCAAAAG 


4333 


1792 


GAAAUGGA 


G 


CUGUCUCU 


1530 


AGAGACAG 


GGCTAGCTACAACGA 


TCCATTTC 


4334 


1803 


GUCUCUCA 


G 


CGCUCCAU 


1531 


ATGGAGCG 


GGCTAGCTACAACGA 


TGAGAGAC 


4335 


1815 


UCCAUCCA 


G 


CUUGAGAG 


1532 


CTCTCAAG 


GGCTAGCTACAACGA 


TGGATGGA 


4336 


1823 


GCUUGAGA 


G 


UAAGGGAU 


1533 


ATCCCTTA 


GGCTAGCTACAACGA 


TCTCAAGC 


4337 


1847 


CCAGAACA 


G 


CCAGUGGA 


1534 


TCCACTGG 


GGCTAGCTACAACGA 


TGTTCTGG 


4338 


1851 


AACAGCCA 


G 


UGGAUGAA 


1535 


TTCATCCA 


GGCTAGCTACAACGA 


TGGCTGTT 


4339 


1862 


GAUGAAUG 


G 


CACAGUGA 


1536 


TCACTGTG 


GGCTAGCTACAACGA 


CATTCATC 


4340 


1867 


AUGGCACA 


G 


UGAUCGUG 


1537 


CACGATCA 


GGCTAGCTACAACGA 


TGTGCCAT 


4341 


1873 


CAGUGAUC 


G 


UGGACAGC 


1538 


GCTGTCCA 


GGCTAGCTACAACGA 


GATCACTG 


4342 


1880 


CGUGGACA 


G 


CACCGUGG 


1539 


CCACGGTG 


GGCTAGCTACAACGA 


TGTCCACG 


4343 


1885 


ACAGCACC 


G 


UGGGAAAG 


1540 


CTTTCCCA 


GGCTAGCTACAACGA 


GGTGCTGT 


4344 


1926 


ACAACGCA 


G 


CCUCCCCA 


1541 


TGGGGAGG 


GGCTAGCTACAACGA 


TGCGTTGT 


4345 


1955 


GGAUCCCA 


G 


UGGACAGA 


1542 


TCTGTCCA 


GGCTAGCTACAACGA 


TGGGATCC 


4346 


1965 


GGACAGAA 


G 


CAAGGUGG 


1543 


CCACCTTG 


GGCTAGCTACAACGA 


TTCTGTCC 


4347 


1970 


GAAGCAAG 


G 


UGGCUUUG 


1544 


CAAAGCCA 


GGCTAGCTACAACGA 


CTTGCTTC 


4348 


1973 


GCAAGGUG 


G 


CUUUGUAG 


1545 


CTACAAAG 


GGCTAGCTACAACGA 


CACCTTGC 


4349 


1981 


GCUUUGUA 


G 


UGGACAAA 


1546 


TTTGTCCA 


GGCTAGCTACAACGA 


TACAAAGC 


4350 


2002 


CCAAAAUG 


G 


CCUACCUC 


1547 


GAGGTAGG 


GGCTAGCTACAACGA 


CATTTTGG 


4351 


2021 


AAUCCCAG 


G 


CAUUGCUA 


1548 


TAGCAATG 


GGCTAGCTACAACGA 


CTGGGATT 


4352 


2032 


UUGCUAAG 


G 


UUGGCACU 


1549 


AGTGCCAA 


GGCTAGCTACAACGA 


CTTAGCAA 


4353 


2036 


UAAGGUUG 


G 


CACUUGGA 


1550 


TCCAAGTG 


GGCTAGCTACAACGA 


CAACCTTA 


4354 


2 051 


GAAAUACA 


G 


UCUGCAAG 


1551 


CTTGCAGA 


GGCTAGCTACAACGA 


TGTATTTC 


4355 


2059 


GUCUGCAA 


G 


CAAGCUCA 


1552 


TGAGCTTG 


GGCTAGCTACAACGA 


TTGCAGAC 


4356 


2063 


GCAAGCAA 


G 


CUCACAAA 


1553 


TTTGTGAG 


GGCTAGCTACAACGA 


TTGCTTGC 


4357 


2091 


ACUGUCAC 


G 


UCCCGUGC 


1554 


GCACGGGA 


GGCTAGCTACAACGA 


GTGACAGT 


4358 



109 



2096 


CACGUCCC 


G 


UGCGUCCA 


1555 


TGGACGCA 


GGCTAGCTACAACGA 


GGGACGTG 


4359 


2100 


UCCCGUGC 


G 


UCCAAUGC 


1556 


GCATTGGA 


GGCTAGCTACAACGA 


GCACGGGA 


4360 


2128 


CAAUUACA 


G 


UGACUUCC 


1557 


GGAAGTCA 


GGCTAGCTACAACGA 


TGTAATTG 


4361 


2156 


GGACACCA 


G 


CAAAUUCC 


1558 


GGAATTTG 


GGCTAGCTACAACGA 


TGGTGTCC 


4362 


2168 


AUUCCCCA 


G 


CCCUCUGG 


1559 


CCAGAGGG 


GGCTAGCTACAACGA 


TGGGGAAT 


4363 


2176 


GCCCUCUG 


G 


UAGUUUAU 


1560 


ATAAACTA 


GGCTAGCTACAACGA 


CAGAGGGC 


4364 


2179 


CUCUGGUA 


G 


UUUAUGCA 


1561 


TGCATAAA 


GGCTAGCTACAACGA 


TACCAGAG 


4365 


22 03 


GCCAAGGA 


G 


CCUCCCCA 


1562 


TGGGGAGG 


GGCTAGCTACAACGA 


TCCTTGGC 


4366 


2221 


TJUCUCAGG 


G 


CCAGUGUC 


1563 


GACACTGG 


GGCTAGCTACAACGA 


CCTGAGAA 


4367 


2225 


CAGGGCCA 


G 


UGUCACAG 


1564 


CTGTGACA 


GGCTAGCTACAACGA 


TGGCCCTG 


4368 


2233 


GUGUCACA 


G 


CCCUGAUU 


1565 


AATCAGGG 


GGCTAGCTACAACGA 


TGTGACAC 


4369 


2248 


UUGAAUCA 


G 


UGAAUGGA 


1566 


TCCATTCA 


GGCTAGCTACAACGA 


TGATTCAA 


4370 


2263 


GAAAAACA 


G 


UUACCUUG 


1567 


CAAGGTAA 


GGCTAGCTACAACGA 


TGTTTTTC 


4371 


2290 


AUAAUGGA 


G 


CAGGUGCU 


1568 


AGCACCTG 


GGCTAGCTACAACGA 


TCCATTAT 


4372 


2294 


UGGAGCAG 


G 


UGCUGAUG 


1569 


CATCAGCA 


GGCTAGCTACAACGA 


CTGCTCCA 


4373 


2318 


GGAUGACG 


G 


UGUCUACU 


1570 


AGTAGACA 


GGCTAGCTACAACGA 


CGTCATCC 


4374 


2331 


UACUCAAG 


G 


UAUUUCAC 


1571 


GTGAAATA 


GGCTAGCTACAACGA 


CTTGAGTA 


4375 


2357 


CACGAAUG 


G 


UAGAUACA 


1572 


TGTATCTA 


GGCTAGCTACAACGA 


CATTCGTG 


4376 


2366 


UAGAUACA 


G 


UGUAAAAG 


1573 


CTTTTACA 


GGCTAGCTACAACGA 


TGTATCTA 


4377 


2374 


GUGUAAAA 


G 


UGCGGGCU 


1574 


AGCCCGCA 


GGCTAGCTACAACGA 


TTTTACAC 


4378 


2380 


AAGUGCGG 


G 


CUCUGGGA 


1575 


TCCCAGAG 


GGCTAGCTACAACGA 


CCGCACTT 


4379 


2392 


UGGGAGGA 


G 


UUAACGCA 


1576 


TGCGTTAA 


GGCTAGCTACAACGA 


TCCTCCCA 


4380 


2401 


UUAACGCA 


G 


CCAGACGG 


1577 


CCGTCTGG 


GGCTAGCTACAACGA 


TGCGTTAA 


4381 


2413 


GACGGAGA 


G 


UGAUACCC 


1578 


GGGTATCA 


GGCTAGCTACAACGA 


TCTCCGTC 


4382 


2424 


AUACCCCA 


G 


CAGAGUGG 


1579 


CCACTCTG 


GGCTAGCTACAACGA 


TGGGGTAT 


4383 


242 9 


CCAGCAGA 


G 


UGGAGCAC 


1580 


GTGCTCCA 


GGCTAGCTACAACGA 


TCTGCTGG 


4384 


2434 


AGAGUGGA 


G 


CACUGUAC 


1581 


GTACAGTG 


GGCTAGCTACAACGA 


TCCACTCT 


4385 


2450 


CAUACCUG 


G 


CUGGAUUG 


1582 


CAATCCAG 


GGCTAGCTACAACGA 


CAGGTATG 


4386 


2523 


CAACACAA 


G 


CAAGUGUG 


1583 


CACACTTG 


GGCTAGCTACAACGA 


TTGTGTTG 


4387 


2527 


ACAAGCAA 


G 


UGUGUUUC 


1584 


GAAACACA 


GGCTAGCTACAACGA 


TTGCTTGT 


4388 


2537 


GUGUUUCA 


G 


CAGAACAU 


1585 


ATGTTCTG 


GGCTAGCTACAACGA 


TGAAACAC 


4389 


2555 


CUCGGGAG 


G 


CUCAUUUG 


1586 


CAAATGAG 


GGCTAGCTACAACGA 


CTCCCGAG 


4390 


2566 


CAUUUGUG 


G 


CUUCUGAU 


1587 


ATCAGAAG 


GGCTAGCTACAACGA 


CACAAATG 


4391 


2612 


CCCACCUG 


G 


CCAAAUCA 


1588 


TGATTTGG 


GGCTAGCTACAACGA 


CAGGTGGG 


4392 


2632 


ACCUGAAG 


G 


CGGAAAUU 


1589 


AATTTCCG 


GGCTAGCTACAACGA 


CTTCAGGT 


4393 


2648 


UCACGGGG 


G 


CAGUCUCA 


1590 


TGAGACTG 


GGCTAGCTACAACGA 


CCCCGTGA 


4394 


2 651 


CGGGGGCA 


G 


UCUCAUUA 


1591 


TAATGAGA 


GGCTAGCTACAACGA 


TGCCCCCG 


4395 


2 674 


CUUGGACA 


G 


CUCCUGGG 


1592 


CCCAGGAG 


GGCTAGCTACAACGA 


TGTCCAAG 


4396 


2 704 


AUGGAACA 


G 


CUCACAAG 


1593 


CTTGTGAG 


GGCTAGCTACAACGA 


TGTTCCAT 


4397 


2712 


GCUCACAA 


G 


UAUAUCAU 


1594 


ATGATATA 


GGCTAGCTACAACGA 


TTGTGAGC 


4398 


2729 


UCGAAUAA 


G 


UACAAGUA 


1595 


TACTTGTA 


GGCTAGCTACAACGA 


TTATTCGA 


4399 


2735 


AAGUACAA 


G 


UAUUCUUG 


1596 


CAAGAATA 


GGCTAGCTACAACGA 


TTGTACTT 


4400 


2757 


AGAGACAA 


G 


UUCAAUGA 


1597 


TCATTGAA 


GGCTAGCTACAACGA 


TTGTCTCT 


4401 


2776 


CUCUUCAA 


G 


UGAAUACU 


1598 


AGTATTCA 


GGCTAGCTACAACGA 


TTGAAGAG 


4402 


2806 


CAAAGGAA 


G 


CCAACUCU 


1599 


AGAGTTGG 


GGCTAGCTACAACGA 


TTCCTTTG 


4403 


2821 


CUGAGGAA 


G 


UCUUUUUG 


1600 


CAAAAAGA 


GGCTAGCTACAACGA 


TTCCTCAG 


4404 


2861 


UGAAAAUG 


G 


CACAGAUC 


1601 


GATCTGTG 


GGCTAGCTACAACGA 


CATTTTCA 


4405 


2887 


CUAUUCAG 


G 


CUGUUGAU 


1602 


ATCAACAG 


GGCTAGCTACAACGA 


CTGAATAG 


4406 


2 8 99 


UUGAUAAG 


G 


UCGAUCUG 


1603 


CAGATCGA 


GGCTAGCTACAACGA 


CTTATCAA 


4407 


2 935 


UUGCACGA 


G 


UAUCUUUG 


1604 


CAAAGATA 


GGCTAGCTACAACGA 


TCGTGCAA 


4408 


2978 


GACACCUA 


G 


UCCUGAUG 


1605 


CATCAGGA 


GGCTAGCTACAACGA 


TAGGTGTC 


4409 


2 991 


GAUGAAAC 


G 


UCUGCUCC 


1606 


GGAGCAGA 


GGCTAGCTACAACGA 


GTTTCATC 


4410 


3 023 


TTATTP& ara 


ri 
\j 
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GGCTAGCTACAACGA 


TGTTGATA 


4411 


3035 


CAUUCCUG 


G 


CAUUCACA 


1608 


TGTGAATG 


GGCTAGCTACAACGA 


CAGGAATG 


4412 


3063 


AUGUGGAA 


G 


UGGAUAGG 


1609 


CCTATCCA 


GGCTAGCTACAACGA 


TTCCACAT 


4413 


3081 


GAACUGCA 


G 


CUGUCAAU 


1610 


ATTGACAG 


GGCTAGCTACAACGA 


TGCAGTTC 


4414 



110 



3091 


UGUCAAUA 


G 


CCUAGGGC 


1611 


GCCCTAGG 


GGCTAGCTACAACGA 


TATTGACA 


4415 


3098 


AGCCUAGG 


G 


CUGAAUUU 


1612 


AAATTCAG 


GGCTAGCTACAACGA 


CCTAGGCT 


4416 


3189 


UGUAGGGG 


G 


CGAUAUAC 


1613 


GTATATCG 


GGCTAGCTACAACGA 


CCCCTACA 


4417 


3242 


UGUAGGGG 


G 


CGAUAUAC 


1613 


GTATATCG 


GGCTAGCTACAACGA 


CCCCTACA 


4417 


3210 


UGUAUAUA 


G 


UACAUUUA 


1614 


TAAATGTA 


GGCTAGCTACAACGA 


TATATACA 


4418 


3279 


UGUAGGGG 


G 


CGAUAAAA 


1615 


TTTTATCG 


GGCTAGCTACAACGA 


CCCCTACA 


4419 


21 


UGGUACAA 


A 


UGGAUGUG 


1616 


CACATCCA 


GGCTAGCTACAACGA 


TTGTACCA 


4420 


25 


ACAAAUGG 


A 


UGUGGAAU 


1617 


ATTCCACA 


GGCTAGCTACAACGA 


CCATTTGT 


4421 


32 


GAUGUGGA 


A 


UAUAAUUG 


1618 


CAATTATA 


GGCTAGCTACAACGA 


TCCACATC 


4422 


37 


GGAAUAUA 


A 


UUGAAUAU 


1619 


ATATTCAA 


GGCTAGCTACAACGA 


TATATTCC 


4423 


42 


AUAAUUGA 


A 


UAUUUUCU 


1620 


AGAAAATA 


GGCTAGCTACAACGA 


TCAATTAT 


4424 


114 


GAAGGCAG 


A 


UGGAAAUA 


1621 


TATTTCCA 


GGCTAGCTACAACGA 


CTGCCTTC 


4425 


120 


AGAUGGAA 


A 


UAUUUACA 


1622 


TGTAAATA 


GGCTAGCTACAACGA 


TTCCATCT 


4426 


137 


AGUACGCA 


A 


UUUGAGAC 


1623 


GTCTCAAA 


GGCTAGCTACAACGA 


TGCGTACT 


4427 


144 


AAUUUGAG 


A 


CUAAGAUA 


1624 


TATCTTAG 


GGCTAGCTACAACGA 


CTCAAATT 


4428 


150 


AGACUAAG 


A 


UAUUGUUA 


1625 


TAACAATA 


GGCTAGCTACAACGA 


CTTAGTCT 


4429 


176 


UAUUGAAG 


A 


CAAGAGCA 


1626 


TGCTCTTG 


GGCTAGCTACAACGA 


CTTCAATA 


4430 


185 


CAAGAGCA 


A 


UAGUAAAA 


1627 


TTTTACTA 


GGCTAGCTACAACGA 


TGCTCTTG 


4431 


193 


AUAGUAAA 


A 


CACAUCAG 


1628 


CTGATGTG 


GGCTAGCTACAACGA 


TTTACTAT 


4432 


217 


GGUUAAAG 


A 


CCUGUGAU 


1629 


ATCACAGG 


GGCTAGCTACAACGA 


CTTTAACC 


4433 


224 


GACCUGUG 


A 


UAAACCAC 


1630 


GTGGTTTA 


GGCTAGCTACAACGA 


CACAGGTC 


4434 


228 


UGUGAUAA 


A 


CCACUUCC 


1631 


GGAAGTGG 


GGCTAGCTACAACGA 


TTATCACA 


4435 


238 


CACUUCCG 


A 


UAAGUUGG 


1632 


CCAACTTA 


GGCTAGCTACAACGA 


CGGAAGTG 


4436 


249 


AGUUGGAA 


A 


CGUGUGUC 


1633 


GACACACG 


GGCTAGCTACAACGA 


TTCCAACT 


4437 


284 


UAUAUAUA 


A 


UGGUAAAG 


1634 


CTTTACCA 


GGCTAGCTACAACGA 


TATATATA 


4438 


297 


AAAGAAAG 


A 


CACCUUCG 


1635 


CGAAGGTG 


GGCTAGCTACAACGA 


CTTTCTTT 


4439 


308 


CCUUCGUA 


A 


CCCGCAUU 


1636 


AATGCGGG 


GGCTAGCTACAACGA 


TACGAAGG 


4440 


331 


AGAGAGGA 


A 


UCACAGGG 


1637 


CCCTGTGA 


GGCTAGCTACAACGA 


TCCTCTCT 


4441 


342 


ACAGGGAG 


A 


UGUACAGC 


1638 


GCTGTACA 


GGCTAGCTACAACGA 


CTCCCTGT 


4442 


352 


GUACAGCA 


A 


UGGGGCCA 


1639 


TGGCCCCA 


GGCTAGCTACAACGA 


TGCTGTAC 


4443 


385 


UCAUCUUG 


A 


UUCUUCAC 


1640 


GTGAAGAA 


GGCTAGCTACAACGA 


CAAGATGA 


4444 


416 


CCUGAGUA 


A 


UUCACUCA 


1641 


TGAGTGAA 


GGCTAGCTACAACGA 


TACTCAGG 


4445 


434 


UCAGCUGA 


A 


CAACAAUG 


1642 


CATTGTTG 


GGCTAGCTACAACGA 


TCAGCTGA 


4446 


437 


GCUGAACA 


A 


CAAUGGCU 


1643 


AGCCATTG 


GGCTAGCTACAACGA 


TGTTCAGC 


4447 


440 


GAACAACA 


A 


UGGCUAUG 


1644 


CATAGCCA 


GGCTAGCTACAACGA 


TGTTGTTC 


4448 


466 


UCGUUGCA 


A 


UCGACCCC 


1645 


GGGGTCGA 


GGCTAGCTACAACGA 


TGCAACGA 


4449 


470 


UGCAAUCG 


A 


CCCCAAUG 


1646 


CATTGGGG 


GGCTAGCTACAACGA 


CGATTGCA 


4450 


476 


CGACCCCA 


A 


UGUGCCAG 


1647 


CTGGCACA 


GGCTAGCTACAACGA 


TGGGGTCG 


4451 


488 


GCCAGAAG 


A 


UGAAACAC 


1648 


GTGTTTCA 


GGCTAGCTACAACGA 


CTTCTGGC 


4452 


493 


AAGAUGAA 


A 


CACUCAUU 


1649 


AATGAGTG 


GGCTAGCTACAACGA 


TTCATCTT 


4453 


504 


CUCAUUCA 


A 


CAAAUAAA 


1650 


TTTATTTG 


GGCTAGCTACAACGA 


TGAATGAG 


4454 


508 


UUCAACAA 


A 


UAAAGGAC 


1651 


GTCCTTTA 


GGCTAGCTACAACGA 


TTGTTGAA 


4455 


515 


AAUAAAGG 


A 


CAUGGUGA 


1652 


TCACCATG 


GGCTAGCTACAACGA 


CCTTTATT 


4456 


523 


ACAUGGUG 


A 


CCCAGGCA 


1653 


TGCCTGGG 


GGCTAGCTACAACGA 


CACCATGT 


4457 


564 


GGAAAGCG 


A 


UUUUAUUU 


1654 


AAATAAAA 


GGCTAGCTACAACGA 


CGCTTTCC 


4458 


578 


UUUCAAAA 


A 


UGUUGCCA 


1655 


TGGCAACA 


GGCTAGCTACAACGA 


TTTTGAAA 


4459 


592 


CCAUUUUG 


A 


UUCCUGAA 


1656 


TTCAGGAA 


GGCTAGCTACAACGA 


CAAAATGG 


4460 


601 


UUCCUGAA 


A 


CAUGGAAG 


1657 


CTTCCATG 


GGCTAGCTACAACGA 


TTCAGGAA 


4461 


610 


CAUGGAAG 


A 


CAAAGGCU 


1658 


AGCCTTTG 


GGCTAGCTACAACGA 


CTTCCATG 


4462 


620 


AAAGGCUG 


A 


CUAUGUGA 


1659 


TCACATAG 


GGCTAGCTACAACGA 


CAGCCTTT 


4463 


630 


UAUGUGAG 


A 


CCAAAACU 


1660 


AGTTTTGG 


GGCTAGCTACAACGA 


CTCACATA 


4464 


636 


AG AC C AAA 


A 


CUUGAGAC 


1661 


GTCTCAAG 


GGCTAGCTACAACGA 


TTTGGTCT 


4465 


643 


AACUUGAG 


A 


CCUACAAA 


1662 


TTTGTAGG 


GGCTAGCTACAACGA 


CTCAAGTT 


4466 


653 


CUACAAAA 


A 


UGCUGAUG 


1663 


CATCAGCA 


GGCTAGCTACAACGA 


TTTTGTAG 


4467 


659 


AAAUGCUG 


A 


UGUUCUGG 


1664 


CCAGAACA 


GGCTAGCTACAACGA 


CAGCATTT 


4468 


692 


UCCAGGUA 


A 


UGAUGAAC 


1665 


GTTCATCA 


GGCTAGCTACAACGA 


TACCTGGA 


4469 



Ill 



695 


AGGUAAUG 


A 


UGAACCCU 


1666 


AGGGTTCA 


GGCTAGCTACAACGA 


CATTACCT 


4470 


699 


AAUGAUGA 


A 


CCCUACAC 


1667 


GTGTAGGG 


GGCTAGCTACAACGA 


TCATCATT 


4471 


715 


CUGAGCAG 


A 


UGGGCAAC 


1668 


GTTGCCCA 


GGCTAGCTACAACGA 


CTGCTCAG 


4472 


722 


GAUGGGCA 


A 


CUGUGGAG 


1669 


CTCCACAG 


GGCTAGCTACAACGA 


TGCCCATC 


4473 


745 


GUGAAAGG 


A 


UCCACCUC 


1670 


GAGGTGGA 


GGCTAGCTACAACGA 


CCTTTCAC 


4474 


761 


CACUCCUG 


A 


UUUCAUUG 


1671 


CAATGAAA 


GGCTAGCTACAACGA 


CAGGAGTG 


4475 


789 


UUAGCUGA 


A 


UAUGGACC 


1672 


GGTCCATA 


GGCTAGCTACAACGA 


TCAGCTAA 


4476 


795 


GAAUAUGG 


A 


CCACAAGG 


1673 


CCTTGTGG 


GGCTAGCTACAACGA 


CCATATTC 


4477 


837 


CAUCUACG 


A 


UGGGGAGU 


1674 


ACTCCCCA 


GGCTAGCTACAACGA 


CGTAGATG 


4478 


851 


AGUAUUUG 


A 


CGAGUACA 


1675 


TGTACTCG 


GGCTAGCTACAACGA 


CAAATACT 


4479 


860 


CGAGUACA 


A 


UAAUGAUG 


1676 


CATCATTA 


GGCTAGCTACAACGA 


TGTACTCG 


4480 


863 


GUACAAUA 


A 


UGAUGAGA 


1677 


TCTCATCA 


GGCTAGCTACAACGA 


TATTGTAC 


4481 


866 


CAAUAAUG 


A 


UGAGAAAU 


1678 


ATTTCTCA 


GGCTAGCTACAACGA 


CATTATTG 


4482 


873 


GAUGAGAA 


A 


UUCUACUU 


1679 


AAGTAGAA 


GGCTAGCTACAACGA 


TTCTCATC 


4483 


887 


CUUAUCCA 


A 


UGGAAGAA 


1680 


TTCTTCCA 


GGCTAGCTACAACGA 


TGGATAAG 


4484 


895 


AUGGAAGA 


A 


UACAAGCA 


1681 


TGCTTGTA 


GGCTAGCTACAACGA 


TCTTCCAT 


4485 


909 


GCAGUAAG 


A 


UGUUCAGC 


1682 


GCTGAACA 


GGCTAGCTACAACGA 


CTTACTGC 


4486 


935 


UGGUACAA 


A 


UGUAGUAA 


1683 


TTACTACA 


GGCTAGCTACAACGA 


TTGTACCA 


4487 


978 


ACCAAAAG 


A 


UGCACAUU 


1684 


AATGTGCA 


GGCTAGCTACAACGA 


CTTTTGGT 


4488 


989 


CACAUUCA 


A 


UAAAGUUA 


1685 


TAACTTTA 


GGCTAGCTACAACGA 


TGAATGTG 


4489 


1002 


GUUACAGG 


A 


CUCUAUGA 


1686 


TCATAGAG 


GGCTAGCTACAACGA 


CCTGTAAC 


4490 


1017 


GAAAAAGG 


A 


UGUGAGUU 


1687 


AACTCACA 


GGCTAGCTACAACGA 


CCTTTTTC 


4491 


1035 


GUUCUCCA 


A 


UCCCGCCA 


1688 


TGGCGGGA 


GGCTAGCTACAACGA 


TGGAGAAC 


4492 


1045 


CCCGCCAG 


A 


CGGAGAAG 


1689 


CTTCTCCG 


GGCTAGCTACAACGA 


CTGGCGGG 


4493 


1063 


CUUCUAUA 


A 


UGUUUGCA 


1690 


TGCAAACA 


GGCTAGCTACAACGA 


TATAGAAG 


4494 


1074 


UUUGCACA 


A 


CAUGUUGA 


1691 


TCAACATG 


GGCTAGCTACAACGA 


TGTGCAAA 


4495 


1082 


ACAUGUUG 


A 


UUCUAUAG 


1692 


CTATAGAA 


GGCTAGCTACAACGA 


CAACATGT 


4496 


1095 


AUAGUUGA 


A 


UUCUGUAC 


1693 


GTACAGAA 


GGCTAGCTACAACGA 


TCAACTAT 


4497 


1107 


UGUACAGA 


A 


CAAAACCA 


1694 


TGGTTTTG 


GGCTAGCTACAACGA 


TCTGTACA 


4498 


1112 


AGAACAAA 


A 


CCACAACA 


1695 


TGTTGTGG 


GGCTAGCTACAACGA 


TTTGTTCT 


4499 


1118 


AAACCACA 


A 


CAAAGAAG 


1696 


CTTCTTTG 


GGCTAGCTACAACGA 


TGTGGTTT 


4500 


1133 


AGCUCCAA 


A 


CAAGCAAA 


1697 


TTTGCTTG 


GGCTAGCTACAACGA 


TTGGAGCT 


4501 


1142 


CAAGCAAA 


A 


UCAAAAAU 


1698 


ATTTTTGA 


GGCTAGCTACAACGA 


TTTGCTTG 


4502 


1149 


AAUCAAAA 


A 


UGCAAUCU 


1699 


AGATTGCA 


GGCTAGCTACAACGA 


TTTTGATT 


4503 


1154 


AAAAUGCA 


A 


UCUCCGAA 


1700 


TTCGGAGA 


GGCTAGCTACAACGA 


TGCATTTT 


4504 


1177 


GGGAAGUG 


A 


UCCGUGAU 


1701 


ATCACGGA 


GGCTAGCTACAACGA 


CACTTCCC 


4505 


1184 


GAUCCGUG 


A 


UUCUGAGG 


1702 


CCTCAGAA 


GGCTAGCTACAACGA 


CACGGATC 


4506 


1193 


UUCUGAGG 


A 


CUUUAAGA 


1703 


TCTTAAAG 


GGCTAGCTACAACGA 


CCTCAGAA 


4507 


1204 


UUAAGAAA 


A 


CCACUCCU 


1704 


AGGAGTGG 


GGCTAGCTACAACGA 


TTTCTTAA 


4508 


1216 


CUCCUAUG 


A 


CAACACAG 


1705 


CTGTGTTG 


GGCTAGCTACAACGA 


CATAGGAG 


4509 


1219 


CUAUGACA 


A 


CACAGCCA 


1706 


TGGCTGTG 


GGCTAGCTACAACGA 


TGTCATAG 


4510 


1232 


GCCACCAA 


A 


UCCCACCU 


1707 


AGGTGGGA 


GGCTAGCTACAACGA 


TTGGTGGC 


4511 


1255 


UGCUGCAG 


A 


UUGGACAA 


1708 


TTGTCCAA 


GGCTAGCTACAACGA 


CTGCAGCA 


4512 


1260 


CAGAUUGG 


A 


CAAAGAAU 


1709 


ATTCTTTG 


GGCTAGCTACAACGA 


CCAATCTG 


4513 


1267 


GACAAAGA 


A 


UUGUGUGU 


1710 


ACACACAA 


GGCTAGCTACAACGA 


TCTTTGTC 


4514 


1286 


AGUCCUUG 


A 


CAAAUCUG 


1711 


CAGATTTG 


GGCTAGCTACAACGA 


CAAGGACT 


4515 


1290 


CUUGACAA 


A 


UCUGGAAG 


1712 


CTTCCAGA 


GGCTAGCTACAACGA 


TTGTCAAG 


4516 


1306 


GCAUGGCG 


A 


CUGGUAAC 


1713 


GTTACCAG 


GGCTAGCTACAACGA 


CGCCATGC 


4517 


1313 


GACUGGUA 


A 


CCGCCUCA 


1714 


TGAGGCGG 


GGCTAGCTACAACGA 


TACCAGTC 


4518 


1322 


CCGCCUCA 


A 


UCGACUGA 


1715 


TCAGTCGA 


GGCTAGCTACAACGA 


TGAGGCGG 


4519 


1326 


CUCAAUCG 


A 


CUGAAUCA 


1716 


TGATTCAG 


GGCTAGCTACAACGA 


CGATTGAG 


4520 


1331 


UCGACUGA 


A 


UCAAGCAG 


1717 


CTGCTTGA 


GGCTAGCTACAACGA 


TCAGTCGA 


4521 


1360 


UGCUGCAG 


A 


CAGUUGAG 


1718 


CTCAACTG 


GGCTAGCTACAACGA 


CTGCAGCA 


4522 


1387 


GGGUUGGG 


A 


UGGUGACA 


1719 


TGTCACCA 


GGCTAGCTACAACGA 


CCCAACCC 


4523 


1393 


GGAUGGUG 


A 


CAUUUGAC 


1720 


GTCAAATG 


GGCTAGCTACAACGA 


CACCATCC 


4524 


1400 


GACAUUUG 


A 


CAGUGCUG 


1721 


CAGCACTG 


GGCTAGCTACAACGA 


CAAATGTC 


4525 



1425 


CAAAGUGA 


A 


CUCAUACA 


1722 


TGTATGAG 


GGCTAGCTACAACGA 


TCACTTTG 


4526 


1435 


UCAUACAG 


A 


UAAACAGU 


1723 


ACTGTTTA 


GGCTAGCTACAACGA 


CTGTATGA 


4527 


1439 


ACAGAUAA 


A 


CAGUGGCA 


1724 


TGCCACTG 


GGCTAGCTACAACGA 


TTATCTGT 


4528 


1451 


UGGCAGUG 


A 


CAGGGACA 


1725 


TGTCCCTG 


GGCTAGCTACAACGA 


CACTGCCA 


4529 


1457 


UGACAGGG 


A 


CACACUCG 


1726 


CGAGTGTG 


GGCTAGCTACAACGA 


CCCTGTCA 


4530 


1473 


GCCAAAAG 


A 


UUACCUGC 


1727 


GCAGGTAA 


GGCTAGCTACAACGA 


CTTTTGGC 


4531 


1498 


CAGGAGGG 


A 


CGUCCAUC 


1728 


GATGGACG 


GGCTAGCTACAACGA 


CCCTCCTG 


4532 


1521 


GGGCUUCG 


A 


UCGGCAUU 


1729 


AATGCCGA 


GGCTAGCTACAACGA 


CGAAGCCC 


4533 


1537 


UUACUGUG 


A 


UUAGGAAG 


1730 


CTTCCTAA 


GGCTAGCTACAACGA 


CACAGTAA 


4534 


1548 


AGGAAGAA 


A 


UAUCCAAC 


1731 


GTTGGATA 


GGCTAGCTACAACGA 


TTCTTCCT 


4535 


1555 


AAUAUCCA 


A 


CUGAUGGA 


1732 


TCCATCAG 


GGCTAGCTACAACGA 


TGGATATT 


4536 


1559 


UCCAACUG 


A 


UGGAUCUG 


1733 


CAGATCCA 


GGCTAGCTACAACGA 


CAGTTGGA 


4537 


1563 


ACUGAUGG 


A 


UCUGAAAU 


1734 


ATTTCAGA 


GGCTAGCTACAACGA 


CCATCAGT 


4538 


1570 


GAUCUGAA 


A 


UUGUGCUG 


1735 


CAGCACAA 


GGCTAGCTACAACGA 


TTCAGATC 


4539 


1582 


UGCUGCUG 


A 


CGGAUGGG 


1736 


CCCATCCG 


GGCTAGCTACAACGA 


CAGCAGCA 


4540 


1586 


GCUGACGG 


A 


UGGGGAAG 


1737 


CTTCCCCA 


GGCTAGCTACAACGA 


CCGTCAGC 


4541 


1595 


UGGGGAAG 


A 


CAACACUA 


1738 


TAGTGTTG 


GGCTAGCTACAACGA 


CTTCCCCA 


4542 


1598 


GGAAGACA 


A 


CACUAUAA 


1739 


TTATAGTG 


GGCTAGCTACAACGA 


TGTCTTCC 


4543 


1619 


GUGCUUUA 


A 


CGAGGUCA 


1740 


TGACCTCG 


GGCTAGCTACAACGA 


TAAAGCAC 


4544 


1629 


GAGGUCAA 


A 


CAAAGUGG 


1741 


CCACTTTG 


GGCTAGCTACAACGA 


TTGACCTC 


4545 


1683 


GCUCAAGA 


A 


CUAGAGGA 


1742 


TCCTCTAG 


GGCTAGCTACAACGA 


TCTTGAGC 


4546 


1702 


UGUCCAAA 


A 


UGACAGGA 


1743 


TCCTGTCA 


GGCTAGCTACAACGA 


TTTGGACA 


4547 


1705 


CCAAAAUG 


A 


CAGGAGGU 


1744 


ACCTCCTG 


GGCTAGCTACAACGA 


CATTTTGG 


4548 


1720 


GUUUACAG 


A 


CAUAUGCU 


1745 


AGCATATG 


GGCTAGCTACAACGA 


CTGTAAAC 


4549 


1733 


UGCUUCAG 


A 


UCAAGUUC 


1746 


GAACTTGA 


GGCTAGCTACAACGA 


CTGAAGCA 


4550 


1745 


AGUUCAGA 


A 


CAAUGGCC 


1747 


GGCCATTG 


GGCTAGCTACAACGA 


TCTGAACT 


4551 


1748 


UCAGAACA 


A 


UGGCCUCA 


1748 


TGAGGCCA 


GGCTAGCTACAACGA 


TGTTCTGA 


4552 


1760 


CCUCAUUG 


A 


UGCUUUUG 


1749 


CAAAAGCA 


GGCTAGCTACAACGA 


CAATGAGG 


4553 


1787 


AUCAGGAA 


A 


UGGAGCUG 


1750 


CAGCTCCA 


GGCTAGCTACAACGA 


TTCCTGAT 


4554 


1830 


AGUAAGGG 


A 


UUAACCCU 


1751 


AGGGTTAA 


GGCTAGCTACAACGA 


CCCTTACT 


4555 


1834 


AGGGAUUA 


A 


CCCUCCAG 


1752 


CTGGAGGG 


GGCTAGCTACAACGA 


TAATCCCT 


4556 


1844 


CCUCCAGA 


A 


CAGCCAGU 


1753 


ACTGGCTG 


GGCTAGCTACAACGA 


TCTGGAGG 


4557 


1855 


GCCAGUGG 


A 


UGAAUGGC 


1754 


GCCATTCA 


GGCTAGCTACAACGA 


CCACTGGC 


4558 


1859 


GUGGAUGA 


A 


UGGCACAG 


1755 


CTGTGCCA 


GGCTAGCTACAACGA 


TCATCCAC 


4559 


1870 


GCACAGUG 


A 


UCGUGGAC 


1756 


GTCCACGA 


GGCTAGCTACAACGA 


CACTGTGC 


4560 


1877 


GAUCGUGG 


A 


CAGCACCG 


1757 


CGGTGCTG 


GGCTAGCTACAACGA 


CCACGATC 


4561 


1895 


GGGAAAGG 


A 


CACUUUGU 


1758 


ACAAAGTG 


GGCTAGCTACAACGA 


CCTTTCCC 


4562 


1918 


UCACCUGG 


A 


CAACGCAG 


1759 


CTGCGTTG 


GGCTAGCTACAACGA 


CCAGGTGA 


4563 


1921 


CCUGGACA 


A 


CGCAGCCU 


1760 


AGGCTGCG 


GGCTAGCTACAACGA 


TGTCCAGG 


4564 


1936 


CUCCCCAA 


A 


UCCUUCUC 


1761 


GAGAAGGA 


GGCTAGCTACAACGA 


TTGGGGAG 


4565 


1949 


UCUCUGGG 


A 


UCCCAGUG 


1762 


CACTGGGA 


GGCTAGCTACAACGA 


CCCAGAGA 


4566 


1959 


CCCAGUGG 


A 


CAGAAGCA 


1763 


TGCTTCTG 


GGCTAGCTACAACGA 


CCACTGGG 


4567 


1985 


UGUAGUGG 


A 


CAAAAACA 


1764 


TGTTTTTG 


GGCTAGCTACAACGA 


CCACTACA 


4568 


1991 


GGACAAAA 


A 


CACCAAAA 


1765 


TTTTGGTG 


GGCTAGCTACAACGA 


TTTTGTCC 


4569 


1999 


ACACCAAA 


A 


UGGCCUAC 


1766 


GTAGGCCA 


GGCTAGCTACAACGA 


TTTGGTGT 


4570 


2014 


ACCUCCAA 


A 


UCCCAGGC 


1767 


GCCTGGGA 


GGCTAGCTACAACGA 


TTGGAGGT 


4571 


2046 


ACUUGGAA 


A 


UACAGUCU 


1768 


AGACTGTA 


GGCTAGCTACAACGA 


TTCCAAGT 


4572 


2071 


GCUCACAA 


A 


CCUUGACC 


1769 


GGTCAAGG 


GGCTAGCTACAACGA 


TTGTGAGC 


4573 


2077 


AAACCUUG 


A 


CCCUGACU 


1770 


AGTCAGGG 


GGCTAGCTACAACGA 


CAAGGTTT 


4574 


2083 


UGACCCUG 


A 


CUGUCACG 


1771 


CGTGACAG 


GGCTAGCTACAACGA 


CAGGGTCA 


4575 


2105 


UGCGUCCA 


A 


UGCUACCC 


1772 


GGGTAGCA 


GGCTAGCTACAACGA 


TGGACGCA 


4576 


2122 


UGCCUCCA 


A 


UUACAGUG 


1773 


CACTGTAA 


GGCTAGCTACAACGA 


TGGAGGCA 


4577 


2131 


UUACAGUG 


A 


CUUCCAAA 


1774 


TTTGGAAG 


GGCTAGCTACAACGA 


CACTGTAA 


4578 


2140 


CUUCCAAA 


A 


CGAACAAG 


1775 


CTTGTTCG 


GGCTAGCTACAACGA 


TTTGGAAG 


4579 


2144 


CAAAACGA 


A 


CAAGGACA 


1776 


TGTCCTTG 


GGCTAGCTACAACGA 


TCGTTTTG 


4580 


2150 


GAACAAGG 


A 


CACCAGCA 


1777 


TGCTGGTG 


GGCTAGCTACAACGA 


CCTTGTTC 


4581 
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2160 


ACCAGCAA 


A 


UUCCCCAG 


1778 


CTGGGGAA 


GGCTAGCTACAACGA 


TTGCTGGT 


4582 




2189 


UUAUGCAA 


A 


UAUUCGCC 


1779 


GGCGAATA 


GGCTAGCTACAACGA 


TTGCATAA 


4583 




2212 


CCUCCCCA 


A 


UUCUCAGG 


1780 


CCTGAGAA 


GGCTAGCTACAACGA 


TGGGGAGG 


4584 




2239 


CAGCCCUG 


A 


UUGAAUCA 


1781 


TGATTCAA 


GGCTAGCTACAACGA 


CAGGGCTG 


4585 




2244 


CUGAUUGA 


A 


UCAGUGAA 


1782 


TTCACTGA 


GGCTAGCTACAACGA 


TCAATCAG 


4586 




2252 


AUCAGUGA 


A 


UGGAAAAA 


1783 


TTTTTCCA 


GGCTAGCTACAACGA 


TCACTGAT 


4587 




2260 


AUGGAAAA 


A 


CAGUUACC 


1784 


GGTAACTG 


GGCTAGCTACAACGA 


TTTTCCAT 


4588 




2274 


ACCUUGGA 


A 


CUACUGGA 


1785 


TCCAGTAG 


GGCTAGCTACAACGA 


TCCAAGGT 


4589 




2282 


ACUACUGG 


A 


UAAUGGAG 


1786 


CTCCATTA 


GGCTAGCTACAACGA 


CCAGTAGT 


4590 




2285 


ACUGGAUA 


A 


UGGAGCAG 


1787 


CTGCTCCA 


GGCTAGCTACAACGA 


TATCCAGT 


4591 




2300 


AGGUGCUG 


A 


UGCUACUA 


1788 


TAGTAGCA 


GGCTAGCTACAACGA 


CAGCACCT 


4592 




2312 


UACUAAGG 


A 


UGACGGUG 


1789 


CACCGTCA 


GGCTAGCTACAACGA 


CCTTAGTA 


4593 




2315 


UAAGGAUG 


A 


CGGUGUCU 


1790 


AGACACCG 


GGCTAGCTACAACGA 


CATCCTTA 


4594 




2341 


AUUUCACA 


A 


CUUAUGAC 


1791 


GTCATAAG 


GGCTAGCTACAACGA 


TGTGAAAT 


4595 




2348 


AACUUAUG 


A 


CACGAAUG 


1792 


CATTCGTG 


GGCTAGCTACAACGA 


CATAAGTT 


4596 




2354 


UGACACGA 


A 


UGGUAGAU 


1793 


ATCTACCA 


GGCTAGCTACAACGA 


TCGTGTCA 


4597 




2361 


AAUGGUAG 


A 


UACAGUGU 


1794 


ACACTGTA 


GGCTAGCTACAACGA 


CTACCATT 


4598 




2396 


AGGAGUUA 


A 


CGCAGCCA 


1795 


TGGCTGCG 


GGCTAGCTACAACGA 


TAACTCCT 


4599 




2406 


GCAGCCAG 


A 


CGGAGAGU 


1796 


ACTCTCCG 


GGCTAGCTACAACGA 


CTGGCTGC 


4600 




2416 


GGAGAGUG 


A 


UACCCCAG 


1797 


CTGGGGTA 


GGCTAGCTACAACGA 


CACTCTCC 


4601 


M 


2455 


CUGGCUGG 


A 


UUGAGAAU 


1798 


ATTCTCAA 


GGCTAGCTACAACGA 


CCAGCCAG 


4602 


™? 


2462 


GAUUGAGA 


A 


UGAUGAAA 


1799 


TTTCATCA 


GGCTAGCTACAACGA 


TCTCAATC 


4603 


\$=? 


2465 


UGAGAAUG 


A 


UGAAAUAC 


1800 


GTATTTCA 


GGCTAGCTACAACGA 


CATTCTCA 


4604 


f 


2470 


AUGAUGAA 


A 


UACAAUGG 


1801 


CCATTGTA 


GGCTAGCTACAACGA 


TTCATCAT 


4605 


is,, s 


2475 


GAAAUACA 


A 


UGGAAUCC 


1802 


GGATTCCA 


GGCTAGCTACAACGA 


TGTATTTC 


4606 


B 


2480 


ACAAUGGA 


A 


UCCACCAA 


1803 


TTGGTGGA 


GGCTAGCTACAACGA 


TCCATTGT 


4607 




2490 


CCACCAAG 


A 


CCUGAAAU 


1804 


ATTTCAGG 


GGCTAGCTACAACGA 


CTTGGTGG 


4608 


SI 


2497 


GACCUGAA 


A 


UUAAUAAG 


1805 


CTTATTAA 


GGCTAGCTACAACGA 


TTCAGGTC 


4609 




2501 


UGAAAUUA 


A 


UAAGGAUG 


1806 


CATCCTTA 


GGCTAGCTACAACGA 


TAATTTCA 


4610 


O 


2507 


UAAUAAGG 


A 


UGAUGUUC 


1807 


GAACATCA 


GGCTAGCTACAACGA 


CCTTATTA 


4611 


GO 


2510 


UAAGGAUG 


A 


UGUUCAAC 


1808 


GTTGAACA 


GGCTAGCTACAACGA 


CATCCTTA 


4612 


o 


2517 


GAUGUUCA 


A 


CACAAGCA 


1809 


TGCTTGTG 


GGCTAGCTACAACGA 


TGAACATC 


4613 




2542 


UCAGCAGA 


A 


CAUCCUCG 


1810 


CGAGGATG 


GGCTAGCTACAACGA 


TCTGCTGA 


4614 


n 


2573 


GGCUUCUG 


A 


UGUCCCAA 


1811 


TTGGGACA 


GGCTAGCTACAACGA 


CAGAAGCC 


4615 




2582 


UGUCCCAA 


A 


UGCUCCCA 


1812 


TGGGAGCA 


GGCTAGCTACAACGA 


TTGGGACA 


4616 




2597 


CAUACCUG 


A 


UCUCUUCC 


1813 


GGAAGAGA 


GGCTAGCTACAACGA 


CAGGTATG 


4617 




2617 


CUGGCCAA 


A 


UCACCGAC 


1814 


GTCGGTGA 


GGCTAGCTACAACGA 


TTGGCCAG 


4618 




2624 


AAUCACCG 


A 


CCUGAAGG 


1815 


CCTTCAGG 


GGCTAGCTACAACGA 


CGGTGATT 


4619 




2638 


AGGCGGAA 


A 


UUCACGGG 


1816 


CCCGTGAA 


GGCTAGCTACAACGA 


TTCCGCCT 


4620 




2660 


UCUCAUUA 


A 


UCUGACUU 


1817 


AAGTCAGA 


GGCTAGCTACAACGA 


TAATGAGA 


4621 




2665 


UUAAUCUG 


A 


CUUGGACA 


1818 


TGTCCAAG 


GGCTAGCTACAACGA 


CAGATTAA 


4622 




2671 


UGACUUGG 


A 


CAGCUCCU 


1819 


AGGAGCTG 


GGCTAGCTACAACGA 


CCAAGTCA 


4623 




2684 


UCCUGGGG 


A 


UGAUUAUG 


1820 


CATAATCA 


GGCTAGCTACAACGA 


CCCCAGGA 


4624 




2687 


UGGGGAUG 


A 


UUAUGACC 


1821 


GGTCATAA 


GGCTAGCTACAACGA 


CATCCCCA 


4625 




2693 


UGAUUAUG 


A 


CCAUGGAA 


1822 


TTCCATGG 


GGCTAGCTACAACGA 


CATAATCA 


4626 




2701 


ACCAUGGA 


A 


CAGCUCAC 


1823 


GTGAGCTG 


GGCTAGCTACAACGA 


TCCATGGT 


4627 




2725 


UCAUUCGA 


A 


UAAGUACA 


1824 


TGTACTTA 


GGCTAGCTACAACGA 


TCGAATGA 


4628 




2744 


UAUUCUUG 


A 


UCUCAGAG 


1825 


CTCTGAGA 


GGCTAGCTACAACGA 


CAAGAATA 


4629 




2753 


UCUCAGAG 


A 


CAAGUUCA 


1826 


TGAACTTG 


GGCTAGCTACAACGA 


CTCTGAGA 


4630 




2762 


CAAGUUCA 


A 


UGAAUCUC 


1827 


GAGATTCA 


GGCTAGCTACAACGA 


TGAACTTG 


4631 




2766 


UUCAAUGA 


A 


UCUCUUCA 


1828 


TGAAGAGA 


GGCTAGCTACAACGA 


TCATTGAA 


4632 




2780 


UCAAGUGA 


A 


UACUACUG 


1829 


CAGTAGTA 


GGCTAGCTACAACGA 


TCACTTGA 


4633 




2 810 


GGAAGCCA 


A 


CUCUGAGG 


1830 


CCTCAGAG 


GGCTAGCTACAACGA 


TGGCTTCC 


4634 




2835 


UUGUUUAA 


A 


CCAGAAAA 


1831 


TTTTCTGG 


GGCTAGCTACAACGA 


TTAAACAA 


4635 




2843 


ACCAGAAA 


A 


CAUUACUU 


1832 


AAGTAATG 


GGCTAGCTACAACGA 


TTTCTGGT 


4636 




2858 


UUUUGAAA 


A 


UGGCACAG 


1833 


CTGTGCCA 


GGCTAGCTACAACGA 


TTTCAAAA 


4637 
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2867 


UGGCACAG A UCUUUUCA 


1834 


TGAAAAGA GGCTAGCTACAACGA CTGTGCCA 


4638 


2894 


GGCUGUUG A UAAGGUCG 


1835 


CGACCTTA GGCTAGCTACAACGA CAACAGCC 


4639 


2903 


UAAGGUCG A UCUGAAAU 


1836 


ATTTCAGA GGCTAGCTACAACGA CGACCTTA 


4640 


2910 


GAUCUGAA A UCAGAAAU 


1837 


ATTTCTGA GGCTAGCTACAACGA TTCAGATC 


4641 


2917 


AAUCAGAA A UAUCCAAC 


1838 


GTTGGATA GGCTAGCTACAACGA TTCTGATT 


4642 


2924 


AAUAUCCA A CAUUGCAC 


1839 


GTGCAATG GGCTAGCTACAACGA TGGATATT 


4643 


2959 


CUCCACAG A CUCCGCCA 


1840 


TGGCGGAG GGCTAGCTACAACGA CTGTGGAG 


4644 


2971 


CGCCAGAG A CACCUAGU 


1841 


ACTAGGTG GGCTAGCTACAACGA CTCTGGCG 


4645 


2984 


UAGUCCUG A UGAAACGU 


1842 


ACGTTTCA GGCTAGCTACAACGA CAGGACTA 


4646 


2989 


CUGAUGAA A CGUCUGCU 


1843 


AGCAGACG GGCTAGCTACAACGA TTCATCAG 


4647 


3008 


UUGUCCUA A UAUUCAUA 


1844 


TATGAATA GGCTAGCTACAACGA TAGGACAA 


4648 


3020 


UCAUAUCA A CAGCACCA 


1845 


TGGTGCTG GGCTAGCTACAACGA TGATATGA 


4649 


3052 


UUUUAAAA A UUAUGUGG 


1846 


CCACATAA GGCTAGCTACAACGA TTTTAAAA 


4650 


3067 


GGAAGUGG A UAGGAGAA 


1847 


TTCTCCTA GGCTAGCTACAACGA CCACTTCC 


4651 


3075 


AUAGGAGA A CUGCAGCU 


1848 


AGCTGCAG GGCTAGCTACAACGA TCTCCTAT 


4652 


3088 


AGCUGUCA A UAGCCUAG 


1849 


CTAGGCTA GGCTAGCTACAACGA TGACAGCT 


4653 


3103 


AGGGCUGA A UUUUUGUC 


1850 


GACAAAAA GGCTAGCTACAACGA TCAGCCCT 


4654 


3114 


UUUGUCAG A UAAAUAAA 


1851 


TTTATTTA GGCTAGCTACAACGA CTGACAAA 


4655 


3118 


UCAGAUAA A UAAAAUAA 


1852 


TTATTTTA GGCTAGCTACAACGA TTATCTGA 


4656 


3123 


UAAAUAAA A UAAAUCAU 


1853 


ATGATTTA GGCTAGCTACAACGA TTTATTTA 


4657 


3127 


UAAAAUAA A UCAUUCAU 


1854 


ATGAATGA GGCTAGCTACAACGA TTATTTTA 


4658 


3146 


UUUUUUUG A UUAUAAAA 


1855 


TTTTATAA GGCTAGCTACAACGA CAAAAAAA 


4659 


3154 


AUUAUAAA A UUUUCUAA 


1856 


TTAGAAAA GGCTAGCTACAACGA TTTATAAT 


4660 


3164 


UUUCUAAA A UGUAUUUU 


1857 


AAAATACA GGCTAGCTACAACGA TTTAGAAA 


4661 


3175 


UAUUUUAG A CUUCCUGU 


1858 


ACAGGAAG GGCTAGCTACAACGA CTAAAATA 


4662 


3265 


UAUUUUAG A CUUCCUGU 


1858 


ACAGGAAG GGCTAGCTACAACGA CTAAAATA 


4662 


3192 


AGGGGGCG A UAUACUAA 


1859 


TTAGTATA GGCTAGCTACAACGA CGCCCCCT 


4663 


3245 


AGGGGGCG A UAUACUAA 


1859 


TTAGTATA GGCTAGCTACAACGA CGCCCCCT 


4663 


3201 


UAUACUAA A UGUAUAUA 


1860 


TATATACA GGCTAGCTACAACGA TTAGTATA 


4664 


3225 


UAUACUAA A UGUAUUCC 


1861 


GGAATACA GGCTAGCTACAACGA TTAGTATA 


4665 


3254 


UAUACUAA A UGUAUUUU 


1862 


AAAATACA GGCTAGCTACAACGA TTAGTATA 


4666 


3282 


AGGGGGCG A UAAAAUAA 


1863 


TTATTTTA GGCTAGCTACAACGA CGCCCCCT 


4667 


3287 


GCGAUAAA A UAAAAUGC 


1864 


GCATTTTA GGCTAGCTACAACGA TTTATCGC 


4668 


3292 


AAAAUAAA A UGCUAAAC 


1865 


GTTTAGCA GGCTAGCTACAACGA TTTATTTT 


4669 


3299 


AAUGCUAA A CAACUGGG 


1866 


CCCAGTTG GGCTAGCTACAACGA TTAGCATT 


4670 


3302 


GCUAAACA A CUGGGUAA 


1867 


TTACCCAG GGCTAGCTACAACGA TGTTTAGC 


4671 



Input Sequence = NM_00 1285. Cut Site = R/Y 

Arm Length = 8. Core Sequence = GGCTAGCTACAACGA 

NM_001285 (Homo sapiens chloride channel, calcium activated, 1 (CLCA1) mRNA, 331 1 bp) 
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Table X: PCR Primers 249.021 



PCR primer 


Seq ID No 


CGAAATCTCGAGCAGACTTGTGGGAGAAGCTC 


5435 


AGCACACTGCAGAGTTGCTGGCCAGCTTACCTCC 


5436 



